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Vaccines 

Field of the Invention 

5 The present invention relates to nucleic acid constructs, 
host cells comprising such constructs and their use in 
nucleic acid vaccines. The invention further relates to 
vaccine formulations comprising such constructs and the 
use of such formulations in medicine. The invention in 
10 particular relates to DNA vaccines that are useful in the 
prophylaxis and treatment of HIV infections, more 
particularly when administered by particle mediated 
delivery. 

15 Background to the Invention 

HIV-1 is the primary cause of the acquired immune 
deficiency syndrome (AIDS) which is regarded as one of 
the world's major health problems. Although extensive 
20 research throughout the world has been conducted to 

produce a vaccine, such efforts thus far have not been 
successful . 

Non- envelope proteins of HIV-1 have been described and 
25 include for example internal structure , proteins such as 
the products of the gag and pol genes and, other non- 
structural proteins such as Rev, Nef , Vif and Tat (Green 
et al.. New England J. Med, 324, 5, 308 et seq (1991) and 
Bryant et al . (Ed. Pizzo) , Pediatr. Infect. Dis. J., 11, 
30 5, 390 et seq (1992) . 

The Gag gene is translated from the full-length RNA to 
yield a precursor polyprotein which is subsequently 
cleaved into 3-5 capsid proteins; the matrix protein, 
35 capsid protein and nucleic acid binding protein and 

- 1 - 
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protease. (1, Fundamental Virology, Fields BN, Knipe DM 
and Howley M 1996 2. Fields Virology vol 2 1996) . 

The gag gene gives rise to the 55 -kilodal ton (kD) Gag 
5 precursor protein, also called p55, which is expressed 

from the unspliced viral mRNA. During translation, the N 
terminus of p55 is myristoylated, triggering its 
association with the cytoplasmic aspect of cell 
membranes. The membrane-associated Gag polyprotein 

10 recruits two copies of the viral genomic RNA along with 
other viral and cellular proteins that triggers the 
budding of the viral particle from the surface of an 
infected cell. After budding, p55 is cleaved by the 
virally encoded protease (a product of the pol gene) 

15 during the process of viral maturation into four smaller 
proteins designated MA (matrix [pl7l ) , CA (capsid [p24] ) , 
NC (nucleocapsid [p9] ) , and p6 , (4) 

In addition to the 3 major Gag protein, all Gag 
20 precursors contain several other regions, which are 
cleaved out and remain in the virion as peptides of 
various sizes. These proteins have different roles e.g. 
the p2 protein has a proposed role in regulating activity 
of the protease and contributes to the correct timing of 
25 proteolytic processing. 

The MA polypeptide is derived from the N- terminal, 
myristoylated end of p55. Most MA molecules remain 
attached to the inner surface of the virion lipid 

30 bilayer, stabilizing the particle. A subset of MA is 

recruited inside the deeper layers of the virion where it 
becomes part of the complex which escorts the viral DNA 
to the nucleus. (5) These MA molecules facilitate the 
nuclear transport of the viral genome because a 

35 karyophilic signal on MA is recognized by the cellular 
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nuclear import machinery. This phenomenon allows HIV to 
infect nondividing cells, an unusual property for a 
retrovirus - 

5 The p24 (CA) protein forms the conical core of viral 
particles. Cyclophilin A has been demonstrated to 
interact with the p24 region of p55 leading to its 
incorporation into HIV particles. The interaction 
between Gag and cyclophilin A is essential because the 
10 disruption of this interaction by cyclosporine A inhibits 
viral replication . 

The NC region of Gag is responsible for specifically 
recognizing the so-called packaging signal of HIV. The 

15 packaging signal consists of four stem loop structures 
located near the 5 ' end of the viral RNA, and is 
sufficient to mediate the incorporation of a heterologous 
RNA into HIV-1 virions. NC binds to the packaging signal 
through interactions mediated by two zinc-finger motifs. 

20 NC also facilitates reverse transcription. 

The p6 polypeptide region mediates interactions between 
p55 Gag and the accessory protein Vpr, leading to the 
incorporation of Vpr into assembling virions. The p6 
25 region also contains a so-called late domain which is 
required for the efficient release of budding virions 
from an infected cell 

The Pol gene encodes two proteins containing the two 
30 activities needed by the virus in early infection, the RT 
and the integrase protein needed for integration of viral 
DNA into cell DNA. The primary product of Pol is cleaved 
by the virion protease to yield the amino terminal RT 
peptide which contains activities necessary for DNA 
35 synthesis (RNA and DNA directed DNA polymerase, 

ribouclease H) and carboxy terminal integrase protein. 

-3- 
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HIV RT is a heterodimer of full-length RT (pG6) and a 
cleavage product (p51) lacking the carboxy terminal Rnase 
integrase domain. 

5 RT is one of the most highly conserved proteins encoded 
by the retroviral genome. Two major activities of RT are 
the DNA Pol and Ribonuclease H. The DNA Pol activity of 
RT uses RNA and DNA as templates interchangeably and like 
all DNA polymerases known is unable to initiate DNA 
10 synthesis de novo, but requires a pre existing molecule 
to serve as a primer (RNA) . 

The Rnase H activity inherent in all RT proteins plays 
the essential role early in replication of removing the 
15 RNA genome as DNA synthesis proceeds. It selectively 
degrades the RNA from all RNA - DNA hybrid molecules. 
Structurally the polymerase and ribo H occupy separate, 
non- overlapping domains with the Pol covering the amino 
two thirds of the Pol . 

20 

The p6 6 catalytic subunit is folded into 5 distinct 
subdomains- The amino terminal 23 of these have the 
portion with RT activity. Carboxy term to these is the 
Rnase H Domain. 

25 

After infection of the host cell, the retroviral RNA 
genome is copied into linear ds DNA by the reverse 
transcriptase that is present in the infecting particle. 
The integrase (reviewed in Skalka AM 99 Adv in Virus Res 

30 52 2 71-273) recognises the ends of the viral DNA, trims 
them and accompanies the viral DNA to a host chromosomal 
site to catalyse integration. Many sites in the host DNA 
can be targets for integration. Although the integrase 
is sufficient to catalyse integration in vitro, it is not 

35 the only protein associated with the viral DNA in vivo - 
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the large protein - viral DNA complex isolated from the 
infected cells has been denoted the pre integration 
complex. This facilitates the acquisition of the host 
cell genes by progeny viral genomes. 

5 

The integrase is made up of 3 distinct domains, the N 
terminal domain, the catalytic core and the c terjminal 
domain. The catalytic core domain contains all of the 
requirements for the chemistry of polynucleotidyl 
10 transfer. 

The Nef protein is known to cause the removal of CD4, the 
HIV receptor, from the cell surface, but the biological 
importance of this function is debated. Additionally Nef 

15 interacts with the signal pathway of T cells and induces 
an active state, which in turn may promote more efficient 
gene expression. Some HIV isolates have mutations in 
this region, which cause them not to encode functional 
protein and are severely compromised in their replication 

20 and pathogenesis in vivo. 

DNA vaccines usually consist of a bacterial plasmid 
vector into which is inserted a strong promoter, the gene 
of interest which encodes for an antigenic peptide and a 

25 polyadenylation/transcriptional termination sequences. 

The gene of interest may encode a full protein or simply 
an antigenic peptide sequence relating to the pathogen, 
tumour or other agent which is intended to be protected 
against. The plasmid can be grown in bacteria, such as 

30 for example E.coli and then isolated and prepared in an 
appropriate medium, depending upon the intended route of 
administration, before being administered to the host. 
Following administration the plasmid is taken up by cells 
of the host where the encoded peptide is produced. The 

35 plasmid vector will preferably be made without an origin 



-5- 



wo 03/025003 



PCT/EP02/10592 



of replication which is functional in eukaryotic cells, 
in order to prevent plasmid replication in the mammalian 
host and integration within chromosomal DNA of the animal 
concerned . 

5 

There are a number of advantages of DNA vaccination 
relative to traditional vaccination techniques. First, 
it is predicted that because of the proteins which are 
encoded by the DNA sequence are synthesised in the host, 

10 the structure or conformation of the protein will be 

similar to the native protein associated with the disease 
state. It is also likely that DNA vaccination will offer 
protection against different strains of a virus, by 
generating cytotoxic T lymphcyte response that recognise 

15 epitopes from conserved proteins. Furthermore, because 
the plasmids are taken up by the host cells where 
antigenic protein can be produced, a long-lasting immune 
response will be elicited. The technology also offers 
the possibility of combing diverse immunogens into a 

20 single preparation to facilitate simultaneous 

immunisation in relation to a number of disease states. 

Helpful background information in relation to DNA 
vaccination is provided in Donnelly et al ^'DNA vaccines" 
25 Ann. Rev Immunol. 1997 15: 617-648, the disclosure of 
which is included herein in its entirety by way of 
reference . 

Summary of the Invention 

30 

The present invention provides novel constructs for use 
in nucleic acid vaccines for the prophylaxis and 
treatment of HIV infections and AIDS. 
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Accordingly, in a first aspect, there is provided a 
nucleic acid molecule comprising a nucleotide sequence 
encoding HIV gag protein or fragment thereof linked to a 
nucleotide sequence encoding a further HIV antigen or 
5 fragment thereof and operably linked to a heterologous 
promoter. The fragment of said nucleotide sequence will 
encode an HIV epitope and typically encode a peptide of 
at least 8 amino acids. The nucleotide sequence is 
preferably a DNA sequence and is preferably contained 
10 within a plasmid without an origin of replication. Such 
nucleic acid molecules are formulated with 

pharmaceutically acceptable excipient, carriers, diluents 
or adjuvants to produce pharmaceutical composition 
suitable for the treatment and/or prophylaxis of HIV 
15 infection and AIDS. 

In a preferred embodiment the DNA sequence is formulated 
onto the surface of inert particles or beads suitable for 
particle mediated drug delivery. Preferably the beads are 
20 gold. 

In a preferred embodiment of the invention there is 
provided a DNA sequence that highly expressed codes for 
gag protein which sequence is optimised to resemble the 
25 codon usage of genes in mammalian cells. In particular, 
the gag protein is optimised to resemble that of highly 
expressed human genes. 

The DNA code has 4 letters (A, T, G and G) and uses these 
30 to spell three letter ^^codons'' which represent the amino 
acids the proteins encoded in an organism'^ s genes . The 
linear sequence of codons along the DNA molecule is 
translated into the linear sequence of amino acids in the 
protein (s) encoded by those genes. The code is highly 
35 degenerate, with 61 codons coding for the 2 0 natural 
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amino acids and 3 codons representing ^^stop'' signals. 
ThuS/ most amino acids are coded for by more than one 
codon - in fact several are coded for by four or more 
different codons. 

5 

Where more than one codon is available to code for a 
given amino acid, it has been observed that the codon 
usage patterns of organisms are highly non- random. 
Different species show a different bias in their codon 

10 selection and, furthermore, utilisation of codons may be 
markedly different in a single species between genes 
which are expressed at high and low levels. This bias is 
different in viruses, plants, bacteria and mammalian 
cells, and some species show a stronger bias away from a 

15 random codon selection than others. For example, humans 
and other mammals are less strongly biased than certain 
bacteria or viruses. For these reasons, there is a 
significant probability that a mammalian gene expressed 
in E . coli or a foreign or recombinant gene expressed in 

20 mammalian cells will have an inappropriate distribution 
of codons for efficient expression. It is believed that 
the presence in a heterologous DNA sequence of clusters 
of codons or an abundance of codons which are rarely 
observed in the host in which expression is to occur, is 

25 predictive of low heterologous expression levels in that 
host . 

In an embodiment of the present invention provides a gag 
polynucleotide sequence which encodes an amino acid 

30 sequence, wherein the codon usage pattern of the 
polynucleotide sequence resembles that of highly 
expressed mammalian genes. Preferably the polynucleotide 
sequence is a DNA sequence. Desirably the codon usage 
pattern of the polynucleotide sequence is typical of 

35 highly expressed human genes. 
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In the polynucleotides of the present invention, the 
codon usage pattern is altered from that typical of human 
immunodeficiency viruses to more closely represent the 
5 aodon bias of the target organism, e.g. a mammal, 

especially a human. The codon usage coefficient" is a 
measure of how closely the codon pattern of a given 
polynucleotide sequence resembles that of a target 
species. Codon frequencies can be derived from literature 

10 sources for the highly expressed genes of many species 
(see e.g. Nakamura et.al. Nucleic Acids Research 1996, 
24:214-215). The codon frequencies for each of the 61 
codons (expressed as the number of occurrences occurrence 
per 100 0 codons of the selected class of genes) are 

15 normalised for each of the twenty natural amino acids, so 
that the value for the most frequently used codon for 
each amino acid is set to 1 and the frequencies for the 
less common codons are scaled to lie between zero and 1 . 
Thus each of the 61 codons is assigned a value of 1 or 

20 lower for the highly expressed genes of the target 

species. In order to calculate a codon usage coefficient 
for a specific polynucleotide, relative to the highly 
expressed genes of that species, the scaled value for 
each codon of the specific polynucleotide are noted and 

25 the geometric mean of all these values is taken (by 

dividing the sum of the natural logs of these values by 
the total number of codons and take the anti-log) . The 
coefficient will have a value between zero and 1 and the 
higher the coefficient the more codons in the 

30 polynucleotide are frequently used codons. If a 

polynucleotide sequence has a codon usage coefficient of 
1, all of the codons are "most frequent" codons for 
highly expressed genes of the target species. 
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According to the present invention, the codon usage 
pattern of the polynucleotide will preferably exclude 
codons with an RSCU value of less than 0.2 in highly 
expressed genes of the target organism. Alternatively, 
5 the codon usage pattern will exclude codons representing 
<10% of the codons used for a particular amino acid. A 
relative synonymous codon usage (RSCU) value is the 
observed number of codons divided by the number expected 
if all codons for that amino acid were used equally 

10 frequently. A polynucleotide of the present invention 
will generally have a codon usage coefficient (or RSCU) 
for highly expressed human genes of greater than 0.3, 
preferably greater than 0.4, most preferably greater than 
0.5 Codon usage tables for human can also be found in 

15 Genebank. 

In comparison, a highly expressed beta actin gene has a 
RSCU of 0.747. The codon usage table for a homo sapiens 
is set out below: 

20 

Codon Usage Table 1 : 

Homo sapiens [gbpri] : 27143 CDS ' s (12816923 codons) 

25 fields: [triplet] [frequency: per thousand] ([number]) 





uau 


17 


• 0 (217684) 


UCU 


14 


. 8 (189419) 


UAU 


12 


. 1 (155645) 


UGU 


10 


. 0 (127719) 




uuc 


20 


.5 (262753) 


UCC 


17 


.5 (224470) 


UAC 


15 


.8 (202481) 


UGC 


12 


.3 (157257) 




UUA 


7 


.3( 93924) 


UCA 


11 


.9(152074) 


UAA 


0 


.7( 9195) 


UGA 


1 


.3( 16025) 


30 


U0G 


12 


.5 (159611) 


UCG 


4 


.5( 57572) 


UAG 


0 


.5( 6789) 


UGG 


12 


. 9 (165930) 




cuu 


12 


.8 (163707) 


ecu 


17 


.3 (222146) 


CAU 


10 


.5 (134186) 


CGU 


4 


. 6( 59454) 




cue 


19 


.3 (247391) 


ccc 


20 


.0 (256235) 


CAC 


14 


.9 (190928) 


CGC 


10 


. 8 (137865) 




CUA 


7 


.0( 89078) 


CCA 


16 


. 7 (214583) 


CAA 


12 


. 0 (153590) 


CGA 


6 


. 3 ( 80709) 


35 


CUG 


39 


.7 (509096) 


CCG 


7 


.0( 89619) 


CAG 


34 


.5 (441727) 


CGG 


11 


.6 (148666) 




AUU 


15 


. 8 (202844) 


ACU 


12 


.9(165392) 


AAU 


17 


. 0 (218508) 


AGU 


12 


. 0 (154442) 




AUC 


21 


.6 (277066) 


ACC 


19 


.3 (247805) 


AAC 


19 


. 8 (253475) 


AGC 


19 


. 3 (247583) 




AUA 


7 


.2( 92133) 


ACA 


14 


.9 (191518) 


AAA 


24 


. 0 (308123) 


AGA 


11 


.5 (147264) 


40 


AUG 


22 


.3 (285776) 


ACG 


6 


.3( 80369) 


AAG 


32 


.6 (418141) 


AGG 


11 


.3 (145276) 
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GUU 10.9(139611) GCU 18.5(236639) GAU 22.4(286742) GGU 10.8(138606) 

GUC 14.6(187333) GCC 28.3(362086) GAG 26.1(334158) GGC 22.7(290904) 

GUA 7.0( 89644) GCA 15.9(203310) GAA 29.1(373151) GGA 16.4(210643) 

GUG 28.8(369006) GCG 7.5 { 96455) GAG 40.2 (515485) GGG 16.4 (209907) 



Coding GC 52.51% 1st letter GC 56.04% 2nd letter GC 
42.35% 3rd letter GC 59.13% 



10 Codon Usage Table 2 (preferred) : 

Codon usage for human (highly expressed) genes 1/24/91 

(human_high . cod) 

AmAcid Codon Number /lOOO Fraction 



15 


Gly 


GGG 


905 . 00 


18 . 76 


0 . 24 




Gly 


GGA 


525. 00 


10 . 88 


0 .14 




Glv 


GGT 


441 . 00 


9 . 14 


0 • 12 




Glv 


GGC 


1867.00 


38.70 


0 . 50 


20 


Glu 


GAG 


2420 . 00 


50 . 16 


0 .75 




Glu 


GAA 


792 ,00 


16 .42 


0 .25 






GAT 


592 .00 


12 . 27 


0 , 25 






GAC 


1821 00 


37 . 75 


0 . 75 


25 


Val 


GTG 


1866 . 00 


38 . 68 


0 .64 




Val 


GTA 


134.00 


2 . 78 


0 .05 




V CL-L 


\jr± J. 


1 Q A no 




0 07 




Val 


GTC 


728 . 00 


15 . 09 


0.25 


30 


Ala 


GCG 


652 . 00 


13 . 51 


0 - 17 




Ala 


GCA 


488 . 00 


10 . 12 


0 . 13 




Ala 


GCT 


654 . 00 


13-56 


0.17 




Ala 


GCC 


2057 . 00 


42 . 64 


0 . 53 


35 


Arg 


AGG 


512 . 00 


10.61 


0.18 




Arg 


AGA 


298 . 00 


6 . 18 


0 , 10 




Ser 


AGT 


354 .00 


7 . 34 


0 . 10 




Ser 


AGC 


1171 . 00 


24 .27 


0.34 


40 


Lys 


AAG 


2117 .00 


43 . 88 


0.82 




Lys 




471 . 00 


9 . 76 


0 . 18 




Asn 


AAT 


314 . 00 


6.51 


0 .22 




Asn 


AAC 


1120 . 00 


23 -22 


0 , 78 


45 


Met 


ATG 


1077 . 00 


22 .32 


1 . 00 




lie 


ATA 


88 - 00 


1.82 


0 . 05 




lie 


ATT 


315 . 00 


6 . 53 


0 . 18 




lie 


ATC 


1369 . 00 


28.38 


0 . 77 


50 


Thr 


ACG 


405 . 00 


8.40 


0 . 15 
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Thr 




373 . 00 


7. 73 


0 . 14 




Thr 


ACT 


358 . GO 


7 .42 


0 . 14 




J. XXJu 




1 R 09 on 






5 


Trp 


TGG 


652 . 00 


13 .51 


1 . 00 




End 


TGA 


109 .00 


2.26 


0 .55 




Cys 


TGT 


325. 00 


6. 74 


0.32 








n Of: no 






10 


End 


TAG 


42 . 00 


0-87 


0 .21 




End 


TAA 


46 . 00 


0 . 95 


0 . 23 




Tyr 


TAT 


360 - 00 


7.46 


0.26 




lyr 




1 n A o n n 






15 


Leu 


TTG 


313 . 00 


6 .49 


0 . 06 




Leu 


TTA 


76 . 00 


1 . 58 


0 . 02 




Phe 


TTT 


336.00 


6 . 96 


0 .20 




Jriie 


"TTP 
X Xv_ 


1 7 '7 on 




0 ft n 

VJ • O VJ 


20 


Ser 


TCG 


325 . 00 


6.74 


0 . 09 




Ser 


TCA 


165 . 00 


3 .42 


0 . 05 




Ser 


TCT 


450 .00 


9 . 33 


0 . 13 




O £i -y^ 


TPP 
X ^ V— 


Q c: Q on 


1 Q ftfn 

X ^ . O w 


0 2 8 


25 


Arg 


CGG 


611 . 00 


12 . 67 


0 .21 




Arg 


CGA 


183 . 00 


3 . 79 


0 . 06 




Arg 


CGT 


210 .00 


4 .35 


0 . 07 




i\jrg 


ppp 


1 o ft n n 

X U O O . \J \J 


9 9 m 


n "^7 


30 


Gin 


GAG 


2020 . 00 


41.87 


0 . 88 




Gin 


CAA 


283 . 00 


5 . 87 


0 . 12 




His 


CAT 


234 . 00 


4 . 85 


0 . 21 




JtlXS 




Q '7 fi no 


1 ft n*^ 

X O ■ VJ o 


0 7 9 


35 


Leu 


CTG 


2884 . 00 


59.78 


0 . 58 




Leu 


CTA 


166 . 00 


3 .44 


0 . 03 




Leu 


CTT 


238 . 00 


4 . 93 


0 . 05 




Leu 


CTC 


1276 - 00 


26 .45 


0.26 


40 


Pro 


CCG 


482 .00 


9 . 99 


0 . 17 




Pro 


CCA 


456 . 00 


■ 


0 . 16 




Pro 


CCT 


568 . 00 


11.77 


0 . 19 




Pro 


CCC 


1410 . 00 


29 .23 


0.48 



45 According to a further aspect of the invention, an 
expression vector is provided which comprises and is 
capable of directing the expression of a polynucleotide 
sequence according to the first aspect of the invention, 
in particular the codon usage pattern of the gag 
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polynucleotide sequence is typical of highly expressed 
mammalian genes, preferably highly expressed human genes. 
The vector may be suitable for driving expression of 
heterologous DNA in bacterial insect or mammalian cells, 
5 particularly human cells. In one embodiment, the 
expression vector is p7313 (see figure 1) . 

In a third embodiment there is provided a gag gene under 
the control of a heterologous promoter fused to a DNA 
10 sequence encoding NEF, a fragment thereof, or HIV Reverse 
Transcriptase (RT) or fragment thereof . The gag portion 
of the gene may be either the N or C terminal portion of 
the fusion. 

15 In a preferred embodiment, the gag gene does not encode 

the gag p6 peptide. Preferably the NEF gene is truncated 
to remove the sequence encoding the N terminal region 
i.e. removal of 3 0-85, preferably 60-85, typically about 
81, preferably the N terminal 65 amino acids. 

20 

In a further embodiment the RT gene is also optimised to 
resemble a highly expressed human gene. The RT 
preferably encodes a mutation to substantially inactivate 
any reverse transcriptase activity. A preferred 
25 inactivation mutation involves the substitution of W 
tryptophan 22 9 for K (lysine) . 

According to a further aspect of the invention, a host 
cell comprising a polynucleotide sequence according to 
30 the invention, or an expression vector according to the 
invention is provided. The host cell may be bacterial, 
e.g. E.coli, mammalian, e.g. human, or may be an insect 
cell. Mammalian cells comprising a vector according to 
the present invention may be cultured cells transfected 
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in vitro or may be transfected in vivo by administration 
of the vector to the mammal . 

The present invention further provides a pharmaceutical 
5 composition comprising a polynucleotide sequence 
according to the invention. Preferably the composition 
comprises a DNA vector. In preferred embodiments the 
composition comprises a plurality of particles, 
preferably gold particles, coated with DNA comprising a 

10 vector encoding a polynucleotide sequence of the 
invention. Preferably the sequence encodes an HIV gag 
amino acid sequence, wherein the codon usage pattern of 
the polynucleotide sequence is typical of highly 
expressed mammalian genes, particularly human genes. In 

15 alternative embodiments, the composition comprises a 
pharmaceutically acceptable excipient and a DNA vector 
according to the second aspect of the present invention. 
The composition may also include an adjuvant. 

20 Thus it is an embodiment of the invention that the 
vectors of the invention be utilised with 
immunostimulatory agents. Preferably the 

immunostimulatory agent are administered at the same time 
as the nucleic acid vector of the invention and in 

25 preferred embodiments are formulated together. Such 

immunostimulatory agents include, but this list is by no 
means exhaustive and does not preclude other agents: 
synthetic imidazoquinolines such as imiquimod [S-26308, 
R-837], (Harrison, et al. ^Reduction of recurrent HSV 

30 disease using imiquimod alone or combined with a 

glycoprotein vaccine''. Vaccine 19: 1820-1826, (2001)); 
and resiquimod [S-28463, R-848] (Vasilakos, et al . ^ 
Adjuvant activites of immune response modifier R-848: 
Comparison with CpG ODN' , Cellular immunology 204: 64-74 

35 (2000)-), Schiff bases of carbonyls and amines that are 
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constitutively expressed on antigen presenting cell and 
T-cell surfaces, such as tucaresol (Rhodes, J. et al. ^ 
Therapeutic potentiation of the immune system by 
costimulatory Schif f-base-f orming drugs'. Nature 377: 71- 
5 75 (1995) ) , cytokine, chemokine and co-stimulatory 
molecules as either protein or peptide, this would 
include pro -inflammatory cytokines such as GM-CSF, IL-1 
alpha,. lL-1 beta, TGF- alpha and TGF - beta, Thl inducers 
such as interferon gamma, IL-2, lL-12, IL-15 and IL-18, 

10 Th2 inducers such as IL-4, IL-5, IL-6, IL-10 and IL-13 

and other chemokine and co- stimulatory genes such as MCP- 
1, MIP-1 alpha, MIP-1 beta, RANTES, TCA-3, CD80, CD86 and 
CD40L, , other immunostimulatory targeting ligands such 
as CTLA-4 and L-selectin, apoptosis stimulating proteins 

15 and peptides such as Fas, (49) , synthetic lipid based 

adjuvants, such as vaxfectin, (Reyes et al., ^Vaxfectin 
enhances antigen specific antibody titres and maintains 
Thl type immune responses to plasmid DNA immunization' , 
Vaccine 19: 3778^3786) squalene, alpha- tocopherol, 

20 polysorbate 80, DOPC and cholesterol, endotoxin, [LPS], 
Beutler, B., ^Endotoxin, ^Toll-like receptor 4, and the 
afferent limb of innate immunity' , Current Opinion in 
Microbiology 3: 23-30 (2000)) ; CpG oligo- and di- 
nucleotides, Sato, Y. et al . , ^Immunostimulatory DNA 

25 sequences necessary for effective intradermal gene 

immunization''. Science 273 (5273): 352-354 (1996). Hemmi, 
H. et al-, ^A Toll-like receptor recognizes bacterial 
DNA', Nature 408: 740-745, (2000) and other potential 
ligands that trigger Toll receptors to produce Thl- 

30 inducing cytokines, such as synthetic Mycobacterial 

lipoproteins. Mycobacterial protein pl9, peptidoglycan, 
teichoic acid and lipid A. 

Certain preferred adjuvants for eliciting a predominantly 
35 Thl -type response include, for example, a Lipid A 
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derivative such as monophosphoryl lipid A, or preferably 
3-de-O-acylated monophosphoryl lipid A. MPL® adjuvants 
are available from Corixa Corporation (Seattle, WA; see, 
for example, US Patent Nos . 4,43 6,72 7; 4,8 77,611; 
5 4,866,034 and 4,912,094). CpG~ containing 

oligonucleotides (in which the CpG dinucleotide is 
unmethylated) also induce a predominantly Thl response. 
Such oligonucleotides are well known and are described, 
for example, in WO 96/02555, WO 99/33488 and U.S. Patent 

10 Nos. 6,008,200 and 5,856,462. Immunostimulatory DNA 
sequences are also described, for example, by Sato et 
al . , Science 273:352, 1996. Another preferred adjuvant 
comprises a saponin, such as Quil A, or derivatives 
thereof, including QS21 and QS7 (Aquila 

15 Biopharmaceuticals Inc., Framingham, MA); Escin; 

Digitonin; or Gypsophila. or Chenopodium quinoa. saponins - 

Also provided are the use of a polynucleotide according 
to the invention, or of a vector according to the 
20 invention, in the treatment or prophylaxis of an HIV 
inf ection. 

The present invention also provides methods of treating 
or preventing HIV infections, any symptoms or diseases 

25 associated therewith, comprising administering an 

effective amount of a polynucleotide, a vector or a 
pharmaceutical composition according to the invention. 
Administration of a pharmaceutical composition may take 
the form of one or more individual doses, for example as 

30 repeat doses of the same DNA plasmid, or in a "prime- 
boost" therapeutic vaccination regime. In certain cases 
the "prime" vaccination may be via particle mediated DNA 
delivery of a polynucleotide according to the present 
invention, preferably incorporated into a plasmid- derived 

35 vector and the "boost" by administration of a recombinant 
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viral vector comprising the same polynucleotide sequence, 
or boosting with the protein in adjuvant. Conversly the 
priming may be with the viral vector or with a protein 
formulation typically a protein formulated in adjuvant 
5 and the boost a DNA vaccine of the present invention. 
Multiple doses of prime and/or boost may be employed. 

In embodiments of the invention fragments of gag, nef or 
RT proteins are contemplated. For example, a 

10 polynucleotide of the invention may encode a fragment of 
an HIV gag, nef or RT protein. A polynucleotide which 
encodes a fragment of at least 8, for example 8-10 amino 
acids or up to 20, 50, 60, 70, 80, 100, 150 or 200 amino 
acids in length is considered to fall within the scope of 

15 the invention as long as the encoded oligo or polypeptide 
demonstrates HIV antigenicity. In particular, but not 
exclusively, this aspect of the invention encompasses the 
situation when the polynucleotide encodes a fragment of a 
complete HIV protein sequence and may represent one or 

20 more discrete epitopes of that protein- Such fragments 

may be codon optimised such that the fragment has a codon 
usage pattern which resembles that of a highly expressed 
mammalian gene. 

25 Preferred constructs according to the present invention 
include : 

1. pl7, p24, fused to truncated NEF (devoid of 
nucleotides encoding terminal amino -acids 1-65) 

30 

2. pl7, p24, RT, truncated NEF (devoid of nucleotides 
encoding terminal amino-acids 1-65) 

3. pl7, p24 (optimised gag) truncated NEF (devoid of 
35 nucleotides encoding terminal amino-acids 1-65) 
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5 



pl7, p24 (optimised gag) RT (optimised) truncated 
NEF (devoid of nucleotides encoding terminal amino- 
acids 1-85) 



5. pl7, p24, RT (optimised) truncated NEF (devoid of 
nucleotides encoding terminal amino-acids 1-65) 

6. Truncated NEF — (devoid of nucleotide 1-65) fused to 
10 optimised pl7, p24 gag. 

7 . Particularly preferred constructs of the invention 
include triple fusions RT-NEF-Gag, and RT-Gag-Nef 
particularly : 

15 

8. Optimised RT, truncated NEF and optimised P17, p24 
(gag) (RNG) 

and 

9. Optimised RT, optimised pl7, 24 (gag), Nef truncate 
20 (devoid of aa 1-65) RGN 



It is preferred that the HIV constructs are derived from 
an HIV Glade B or Glade G, particularly clade B. 



25 As discussed above, the present invention includes 

expression vectors that comprise the nucleotide sequences 
of the invention. Such expression vectors are routinely 
constructed in the art of molecular biology and may for 
example involve the use of plasmid DNA and appropriate 

30 initiators, promoters, enhancers and other elements, such 
as for example polyadenylation signals which may be 
necessary, and which are positioned in the correct 
orientation, in order to allow for protein expression. 
Other suitable vectors would be apparent to persons 

35 skilled in the art - By way of further example in this 
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regard we refer to Sambrook et al. Molecular Cloning: a 
Laboratory Manual. 2^^ Edition. CSH Laboratory Press. 
(1989) . 

5 Preferably, a polynucleotide of the invention, or for use 
in the invention in a vector, is operably linked to a 
control sequence which is capable of providing for the 
expression of the coding sequence by the host cell, i.e. 
the vector is an expression vector. The term operably 

10 linked'' refers to a juxtaposition wherein the components 
described are in a relationship permitting them to 
function in their intended manner. A regulatory 
sequence, such as a promoter, ^""operably linked'' to a 
coding sequence is positioned in such a way that 

15 expression of the coding sequence is achieved under 
conditions compatible with the regulatory sequence. 

The vectors may be, for example, plasmids, artificial 
chromosomes (e.g. BAG, PAC, YAC) , virus or phage vectors 

20 provided with a origin of replication, optionally a 

promoter for the expression of the polynucleotide and 
optionally a regulator of the promoter- The vectors may 
contain one or more selectable marker genes, for example 
an ampicillin or kanamycin resistance gene in the case of 

25 a bacterial plasmid or a resistance gene for a fungal 
vector. Vectors may be used in vitro, for example for 
the production of DNA or RNA or used to transfect or 
transform a host cell, for example, a mammalian host cell 
e.g. for the production of protein encoded by the vector. 

30 The vectors may also be adapted to be used in vivo, for 
example in a method of DNA vaccination or of gene 
therapy . 

Promoters and other expression regulation signals may be 
35 selected to be compatible with the host cell for which 
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expression is designed. For example, mammalian promoters 
include the metallothionein promoter, which can be 
induced in response to heavy metals such as cadmium, and 

the p~actin promoter. Viral promoters such as the SV40 
5 large T antigen promoter, human cytomegalovirus (CMV) 
immediate early (IE) promoter, rous sarcoma virus LTR 
promoter, adenovirus promoter, or a HPV promoter, 
particularly the HPV upstream regulatory region (URR) may 
also be used. All these promoters are well described and 
10 readily available in the art . 

A preferred promoter element is the CMV immediate early 
promoter, devoid of intron A but including exon 1. The 
promoter element may be the minimal promoter element or 
15 the enhanced promoter, the enhanced promoter being 
preferred. Accordingly there is provided a vector 
comprising a polynucleotide of the invention under the 
control of HCMV IE early promoter. 

20 Examples of suitable viral vectors include herpes simplex 
viral vectors, vaccinia or alpha-virus vectors and 
retroviruses, including lentiviruses, adenoviruses and 
adeno-associated viruses. Gene transfer techniques using 
these viruses are known to those skilled in the art. 

25 Retrovirus vectors for example may be used to stably 

integrate the polynucleotide of the invention into the 
host genome, although such recombination is not 
preferred. Replication-defective adenovirus vectors by 
contrast remain episomal and therefore allow transient 

30 expression. Vectors capable of driving expression in 

insect cells (for example baculovirus vectors) , in human 
cells, in yeast or in bacteria may be employed in order 
to produce quantities of the HIV protein encoded by the 
polynucleotides of the present invention, for example for 

35 use as subunit vaccines or in immunoassays. 
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The polynucleotides according to the invention have 
utility in the production by expression of the encoded 
proteins, which expression may take place in vitro ^ in 
5 vivo or vivo. The nucleotides may therefore be 

involved in recombinant protein synthesis, for example to 
increase yields, or indeed may find use as therapeutic 
agents in their own right, utilised in DNA vaccination 
techniques. Where the polynucleotides of the present 

10 invention are used in the production of the encoded 

proteins in vitro or ex vivo, cells, for example in cell 
culture, will be modified to include the polynucleotide 
to be expressed. Such cells include transient, or 
preferably stable mammalian cell lines. Particular 

15 examples of cells which may be modified by insertion of 
vectors encoding for a polypeptide according to the 
invention include mammalian HEK293T, CHO, HeLa, 293 and 
COS cells. Preferably the cell line selected will be one 
which is not only stable, but also allows for mature 

20 glycosylation and cell surface expression of a 

polypeptide. Expression may be achieved in transformed 
oocytes. A polypeptide may be expressed from a 
polynucleotide of the present invention, in cells of a 
transgenic non-human animal, preferably a mouse. A 

25 transgenic non-human animal expressing a polypeptide from 
a polynucleotide of the invention is included within the 
scope of the invention. 

The invention further provides a method of vaccinating a 
30 mammalian subject which comprises administering thereto 
an effective amount of such a vaccine or vaccine 
composition. Most preferably, expression vectors for use 
in DNA vaccines, vaccine compositions and 
immunotherapeutics will be plasmid vectors . 

35 
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DNA vaccines may be administered .in the form of "naked 
DNA""', for example in a liquid formulation administered 
using a syringe or high pressure jet, or DNA formulated 
with liposomes or an irritant transfection enhancer, or 
5 by particle mediated DNA delivery (PMDD) . All of these 
delivery systems are well known in the art. The vector 
may be introduced to a mammal for example by means of a 
viral vector delivery system. 

10 The compositions of the present invention can be 

delivered by a number of routes such as intramuscularly, 
subcutaneously, intraperitonally , intravenously or 
mucosally . 

15 In a preferred embodiment, the composition is delivered 
intradermal ly . In particular, the composition is 
delivered by means of a gene gun particularly particle 
bombardment administration techniques which involve 
coating the vector on to a bead (eg gold) which are then 

20 administered under high pressure into the epidermis; such 
as, for example, as described in Haynes et al, J 
Biotechnology 44: 37-42 (1996). 

In one illustrative example, gas -driven particle 
25 acceleration can be achieved with devices such as those 
manufactured by Powder ject Pharmaceuticals PLC (Oxford, 
UK) and Powder ject Vaccines Inc. (Madison, WI) , some 
examples of which are described in U.S. Patent Nos . 
5,846,796; 6,010,478; 5,865,796; 5,584,807; andEP Patent 
30 No. 0500 799. This approach offers a needle-free 

delivery approach wherein a dry powder formulation of 
microscopic particles, such as polynucleotide, are 
accelerated to high speed within a helium gas jet 
generated by a hand held device, propelling the particles 
35 into a target tissue of interest, typically the skin. 
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The particles are preferably gold beads of a 0.4 - 4.0 
^uri;. more preferably 0-6 - 2.0 [am diameter and the DNA 
conjugate coated onto these and then encased in a 
cartridge or cassette for placing into the ^^gene gun'"'- 

5 

In a related embodiment , other devices and methods that 
may be useful for gas-driven needle-less injection of 
compositions of the present invention include those 
provided by Bioject, Inc. (Portland, OR), some examples 
10 of which are described in U.S. Patent Nos . 4,790,824; 

5,064,413; 5,312,335; 5,383,851; 5,399,163; 5,520,639 and 
5, 993, 412 . 

The vectors which comprise the nucleotide sequences 
15 encoding antigenic peptides are administered in such 
amount as will be prophylactically or therapeutically 
effective. The quantity to be administered, is generally 
in the range of one picogram to 1 milligram, preferably 1 
picogram to 10 micrograms for particle-mediated delivery, 
20 and 100 nanograms to 1 milligram, preferably 10 
micrograms to 1 milligram, for other routes, of 
nucleotide per dose. The exact quantity may vary 
considerably depending on the weight of the patient being 
immunised and the route of administration, 

25 

It is possible for the immunogen component comprising the 
nucleotide sequence encoding the antigenic peptide, to be 
administered on a once off basis or to be administered 
repeatedly, for example, between 1 and 7 times, 

30 preferably between 1 and 4 times, at intervals between 

about 1 day and about 18 months. However, this treatment 
regime will be significantly varied depending upon the 
size the patient concerned, the amount of nucleotide 
sequence administered, the route of administration, and 

35 other factors which would be apparent to a skilled 
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veterinary or medical practitioner. The patient may 
receive one or more other anti HIV retroviral drugs as 
part of their overall treatment regime. Additionally the 
nucleic acid immunogen may be administered with an 
5 ad j uvant . 

The adjuvant component specified herein can similarly be 
administered via a variety of different administration 
routes, such as for example, via the oral, nasal, 

10 pulmonary, intramuscular, subcutaneous, intradermal or 
topical routes. Preferably, the adjuvant component is 
administered via the intradermal or topical routes. Most 
preferably by the topical route. This administration may 
take place between about 14 days prior to and about 14 

15 days post administration of the nucleotide sequence, 

preferably between about Iday prior to and about 3 days 
post administration of the nucleotide sequence. The 
adjuvant component is, in an embodiment, administered 
substantially simultaneously with the administration of 

20 the nucleotide sequence. By ^^substantially simultaneous" 
what is meant is that administration of the adjuvant 
component is preferably at the same time as 
administration of the nucleotide sequence, or if not, at 
least within a few hours either side of nucleotide 

25 sequence administration. In the most preferred treatment 
protocol, the adjuvant component will be administered 
substantially simultaneously to administration of the 
nucleotide sequence- Obviously, this protocol can be 
varied as necessary, in accordance with the type of 

30 variables referred to above. It is preferred that the 
adjuvant is a IH - imidazo [4,5c] quinoline - 4 - amine 
derivative such as imiquimod. Typically imiquimod will 
be presented as a topical cream formulation and will be 
administered according to the above protocol . 

35 
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Once again, depending upon such variables, the dose of 
administration of the derivative will also vary, but may, 
for example, range between about 0 . Img per kg to about 
lOOmg per kg, where ^'per kg'' refers to the body weight of 
5 the mammal concerned. This administration of the.lH- 
imidazo [4 , 5-c] quinolin-4 -amine derivative would 
preferably be repeated with each subsequent or booster 
administration of the nucleotide sequence. Most 
preferably, the administration dose will be between about 
10 Img per kg to about 50mg per kg. In the case of a ^^prim- 
boost" scheme as described herein, the imiquimod or other 
IH-imidazo [4 , 5-c] quinolin-4 -amine derivative may be 
administered with either the prime or the boost or with 
both the prime and the boost . 

15 

While it is possible for the adjuvant component to 
comprise only IH-imidazo [4 , 5-c] quinolin-4 -amine 
derivatives to be administered in the raw chemical state, 
it is preferable for administration to be in the form of 

20 a pharmaceutical formulation. That is, the adjuvant 

component will preferably comprise the IH-imidazo [4 , 5- 
c] quinolin--4-amine combined with one or more 
pharmaceutically acceptable carriers, and optionally 
other therapeutic ingredients. The carrier (s) must be 

25 ^^acceptable" in the sense of being compatible with other 
ingredients within the formulation, and not deleterious 
to the recipient thereof. The nature of the formulations 
will naturally vary according to the intended 
administration route, and may be prepared by methods well 

30 known in the pharmaceutical art. All methods include the 
step of bringing into association a IH-imidazo [4 , 5- 
c] quinolin-4 -amine derivative with an appropriate carrier 
or carriers. In general, the formulations are prepared 
by uniformly and intimately bringing into association the 

35 derivative with liquid carriers or finely divided solid 
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carriers, or both, and then, if necessary, shaping the 
product into the desired formulation. Formulations of 
the present invention suitable for oral administration 
may be presented as discrete units such as capsules, 
5 cachets or tablets each containing a pre -determined 

amount of the active ingredient; as a powder or granules; 
as a solution or a suspension in an aqueous liquid or a 
non-aqueous liquid; or as an oil-in-water liquid emulsion 
or a water-in-oil emulsion. The active ingredient may 
10 also be presented as a bolus, electuary or paste. 

A tablet may be made by compression or moulding, 
optionally with one or more accessory ingredients. 
Compressed tablets may be prepared by compressing in a 

15 suitable machine the active ingredient in a free- flowing 
form such as a powder or granules, optionally mixed with 
a binder, lubricant, inert diluent, lubricating, surface 
active or dispersing agent- Moulded tablets may be made 
by moulding in a suitable machine a mixture of the 

20 powdered compound moistened with an inert liquid diluent. 

The tablets may optionally be coated or scored and may be 
formulated so as to provide slow or controlled release of 
the active ingredient . 

25 

Formulations for injection via, for example, the 
intramuscular, intraperitoneal, or subcutaneous 
administration routes include aqueous and non- aqueous 
sterile injection solutions which may contain 

30 antioxidants, buffers, bacteriostats and solutes which 
render the formulation isotonic with the blood of the 
intended recipient; and aqueous and non-aqueous sterile 
suspensions which may include suspending agents and 
thickening agents. The formulations may be presented in 

35 unit -dose or multi-dose containers, for example, sealed 
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ampoules and vials, and may be stored in a freeze-dried 
(lyophilised) condition requiring only the addition of 
the sterile liquid carrier, for example, water for 
injections, immediately prior to use. Extemporaneous 
5 injection solutions and suspensions may be prepared from 
sterile powders, granules and tablets of the kind 
previously described- Formulations suitable for 
pulmonary administration via the buccal or nasal cavity 
are presented such that particles containing the active 

10 ingredient, desirably having a diameter in the range of 
0.5 to 7 microns, are delivered into the bronchial tree 
of the recipient. Possibilities for such formulations 
are that they are in the form of finely comminuted 
powders which may conveniently be presented either in a 

15 piercable capsule, suitably of, for example, gelatine, 
for use in an inhalation device, or alternatively, as a 
self-propelling formulation comprising active ingredient, 
a suitable liquid propellant and optionally, other 
ingredients such as surfactant and/or a solid diluent, 

20 Self-propelling formulations may also be employed wherein 
the active ingredient is dispensed in the form of 
droplets of a solution or suspension. Such self- 
propelling formulations are analogous to those known in 
the art and may be prepared by established procedures. 

25 They are suitably provided with either a manually- 
operable or automatically functioning valve having the 
desired spray characteristics; advantageously the valve 
is of a metered type delivering a fixed volume, for 
example, 50 to 100 iiL, upon each operation thereof. 

30 

In a further possibility, the adjuvant component may be 
in the form of a solution for use in an atomiser or 
nebuliser whereby an accelerated airstream or ultrasonic 
agitation is employed to produce a find droplet mist for 
35 inhalation - 
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Formulations suitable for intranasal administration 
generally include presentations similar to those 
described above for pulmonary administration, although it 
5 is preferred for such formulations to have a particle 
diameter in the range of about 10 to about 2 00 microns, 
to enable retention within the nasal cavity. This may be 
achieved by, as appropriate, use of a powder of a 
suitable particle size, or choice of an appropriate 

10 valve. Other suitable formulations include coarse 

powders having a particle diameter in the range of about 
2 0 to about 500 microns, for administration by rapid 
inhalation through the nasal passage from a container 
held close up to the nose, and nasal drops comprising 

15 about 0,2 to 5% w/w of the active ingredient in aqueous 

or oily solutions. In one embodiment of the invention, it 
is possible for the vector which comprises the nucleotide 
sequence encoding the antigenic peptide to be 
administered within the same formulation as the IH- 

20 imidazo [4 , 5-c] quinolin-4-amine derivative. Hence in this 
embodiment, the immunogenic and the adjuvant component 
are found within the same formulation. 

In an embodiment the adjuvant component is prepared in a 
25 form suitable for gene-gun administration, and is 

administered via that route substantially simultaneous to 
administration of the nucleotide sequence. For 
preparation of formulations suitable for use in this 
manner, it may be necessary for the IH-imidazo [4 , 5- 
30 c] quinolin-4 -amine derivative to be lyophilised and 

adhered onto, for example, gold beads which are suited 
for gene -gun administration. 
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In an alternative embodiment, the adjuvant component may 
be administered as a dry* powder, via high pressure gas 
propulsion. 

5 Even if not formulated together, it may be appropriate 

for the adjuvant component to be administered at or about 
the same administration site as the nucleotide sequence. 

Other details of pharmaceutical preparations can be found 
10 in Remington^ s Pharmaceutical Sciences, Mack Publishing 
Company, Easton, Pennysylvania (1985) , the disclosure of 
which is included herein in its entirety, by way of 
reference . 

15 Suitable techniques for introducing the naked 

polynucleotide or vector into a patient also include 
topical application with an appropriate vehicle. The 
nucleic acid may be administered topically to the skin, 
or to mucosal surfaces for example by intranasal, oral, 

20 intravaginal or intrarectal administration. The naked 
polynucleotide or vector may be present together with a 
pharmaceutically acceptable excipient, such as phosphate 
buffered saline (PBS) . DNA uptake may be further 
facilitated by use of facilitating agents such as 

25 bupivacaine, either separately or included in the DNA 

formulation. Other methods of administering the nucleic 
acid directly to a recipient include ultrasound, 
electrical stimulation, electroporation and microseeding 
which is described in US-5 , 697 , 901 . 

30 

Uptake of nucleic acid constructs may be enhanced by 
several known transfection techniques, for example those 
including the use of transfection agents. Examples of 
these agents includes cationic agents, for example, 
35 calcium phosphate and DEAE-Dextran and lipof ectants , for 
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example, lipofectam and transf ectatn. The dosage of the 
nucleic acid to be administered can be altered. 

A nucleic acid sequence of the present invention may also 
5 be administered by means of specialised delivery vectors 
useful in gene therapy. Gene therapy approaches are 
discussed for example by Verme et al, Nature 1997, 
389:239-242. Both viral and non-viral vector systems can 
be used. Viral based systems include retroviral, 

10 lentiviral, adenoviral, adeno-associated viral, herpes 

viral, Canarypox and vaccinia-viral based systems. Non- 
viral based systems include direct administration of 
nucleic acids, microsphere encapsulation technology 
(poly (lactide-co-glycolide) and, liposome-based systems . 

15 Viral and non- viral delivery systems may be combined 

where it is desirable to provide booster injections after 
an initial vaccination, for example an initial "prime" 
DNA vaccination using a non-viral vector such as a 
plasmid followed by one or more "boost" vaccinations 

20 using a viral vector or non- viral based system. 

Similarly the invention contemplates prime boot systems 
with the polynucleotide of the invention, followed by 
boosting with protein in adjuvant or vice versa. 

25 A nucleic acid sequence of the present invention may also 
be administered by means of transformed cells. Such 
cells include cells harvested from a subject. The naked 
polynucleotide or vector of the present invention can be 
introduced into such cells in vitro and the transformed 

30 cells can later be returned to the subject. The 

polynucleotide of the invention may integrate into 
nucleic acid already present in a cell by homologous 
recombination events- A transformed cell may, if 
desired, be grown up in vitjro and one or more of the 

35 resultant cells may be used in the present invention . 
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Cell be provided at an appropriate site m a patient 

by known surgical or microsurgical techniques (e.g. 
grafting, micro-injection, etc.) 

5 The pharmaceutical compositions of the present invention 
may include adjuvant compounds, as detailed above, or 
other substances which may serve to increase the immune 
response induced by the protein which is encoded by the 
DNA. These may be encoded by the DNA, either separately 

10 from or as a fusion with the antigen, or may be included 
as non-DNA elements of the formulation. Examples of 
adjuvant -type substances which may be included in the 
formulations of the present invention include ubiquitin, 
lysosomal associated membrane protein (LAMP) , hepatitis B 

15 virus core antigen, FLT3-ligand (a cytokine important in 
the generation of professional antigen presenting cells, 
particularly dentritic cells) and other cytokines such as 

IFN-y and GMCSF. Other preferred adjuvants include 
Imiquimod and Resimquimod and Tucarasol . Imiquimod being 
20 particularly preferred . 

The present invention in a preferred embodiments of the 
invention provides the use of a nucleic acid molecule as 
herein described for the treatment or prophylaxis of HIV 
25 infection. The nucleic acid molecule is preferably 

administered with Imiquimod. The Imiquimod is preferably 
administered topically, whereas the nucleic acid molecule 
is preferably administered by means of the particle 
mediated delivery. 

30 

Accordingly the present invention provides a method of 
treating a subject suffering from or susceptible to HIV 
infection, comprising administering a nucleic acid 
molecule as herein described and Imiquimod. 

35 
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The present invention will now be described by reference 
to the following examples: 

EXAMPLES 

5 

Example 1: Optimisation of p55 gag (pl7, p24, pl3} to 

resemble codon usage of highly expressed human genes. 



Gene of interest 

10 

A synthetic gene coding for the p55gag antigen of the 
HIV-1 clade B strain HXB2 (GenBank entry K03455) , 
optimised for expression in mammalian cells was assembled 
from overlapping oligonucleotides by PGR. 

15 

Optimisation involved changing the codon usage pattern of 
the viral gene to give a codon frequency closer to that 
found in highly expressed human genes. Codons were 
assigned using a statistical Visual Basic program called 
20 Syngene (an updated version of Calcgene, written by R.S. 
Hale and G. Thompson, Protein Expression and Purification 
Vol. 12 pp 185-188, 1998) 

Cloning : 

25 

The 1528bp gag PGR product was gel purified, cut with 
restriction endonucleases NotI and Bam HI and ligated 
into Notl/BamHI cut vector WRG7077. This places the gene 
between the CMV promoter/ intron A and the Bovine growth 
30 hormone polyadenylation signal . 

Clones were sequenced and checked for errors. No single 
clone was 100% correct. Regions of correct sequence from 
two clones were therefore combined by overlapping PGR 
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using appropriate combinations of the optimisation oligo 
set to give a full length codon optimised gag gene. This 
final clone was subsequently found to contain a single 
nucleotide deletion which resulted in a frame shift and 
5 premature termination of translation. The deletion was 

repaired by cutting out the region of the gene containing 
the incorrect sequence and cloning in the correct 
sequence from the equivalent region of another clone. 
This gave the final codon optimised p55 gag clone: 
10 Gagoptrpr2 - (See figure 2) 

Example 2: Production of a pl7/p24 truncated Nef 

fusion gene 



IS Gene of Interest 

The pi 7 and p24 portions of the p55gag gene derived from 
the HIV-1 clade B strain HXB2 was PGR amplified from the 
plasmid pHXB?Pr (B .Maschera, E Furfine and E.D. Blair 
20 1995 J.Virol 69 5431-5436) . pHXB?Pr. 426bp from the 3' 
end of the HXB2 nef gene were amplified from the same 
plasmid. Since the HXB2 nef gene contains a premature 
termination codon two overlapping PCRs were used to 
repair the codon (TGA [stop] to TGG [Trp] ) 

25 

The pi 7 /p2 4 linker and trNEFl inker PGR products were 
joined to form the pl7p24trNEF fusion gene (figure 3) in 
a PGR reaction (antisense) 

30 The 1542bp product was gel purified, cut with restriction 
endonucleases Notl and BamHI and cloned into the NotI 
BamHI sites of vector WRG7077- This places the gene 
between the CMV promoter/ intron A and the Bovine growth 
hormone polyadenylation signal . 
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Example 3 : Production of an Gag pl7/24opt/trNef 1 

( ^Gagopt/Nef ) fusxon gene. 

5 

Gene of interest 

The pl7/p24 portion of the codon optimised p55gag gene 
derived from the HIV-1 clade B strain HXB2 was PGR 
amplified from the plasmid pGagOPTrpr2 . The truncated 
HXB2 Nef gene with the premature termination codon 
repaired (TGA [stop] to TGG [Trp] ) was amplified by PGR 
from the plasmid 7077trNef20. The two PGR products were 
designed to have overlapping ends so that the two genes 
could be joined in a second PGR. 

The 1544bp product was gel purified, cut with restriction 
endonucleases NotI and BamHI and cloned (see figures) 
into the NotI BamHI sites of vector WRG7077. This places 
20 the gene between the GMV promoter/ intron A and the Bovine 
growth hormone polyadenylation signal . 

Example 4: Plasmid: p7077-RT3 Clone #A 

25 Gene of Interest: 

A synthetic gene coding for the RT portion of the pol 
gene of HIV-1 clade B strain HXB2, optimised for 
expression in mammalian cells assembled from overlapping 
30 oligonucleotides by PGR. The sequence cloned is 

equivalent to positions 2550-4222 of the HXB2 reference 
sequence (GenBank entry K03455) . To ensure expression the 
cloned sequence has two additional codons at the S'' end 
not present in the original gene - AUG GGC (Met Gly) . 

-34- 



10 



15 



wo 03/025003 



PCT/EP02/10592 



Optimisation involved changing the codon usage pattern of 
the viral gene to give a codon frequency closer to that 
found in highly expressed human genes, but excluding 
rarely used codons . Codons were assigned using a 
5 statistical Visual Basic program called SyngQiiG (an 

updated version of Calcgene, written by R.S. Hale and G. 
Thompson, Protein Expression and Purification Vol. 12 pp 
185-188, 1998) 

10 The final clone was constructed from two intermediate 
clones, # 16 and #21. 

Cloning : 

15 The 1 . 7kb PGR products were gel purified, cut with NotI 

and BaraHI and PGR cleaned, before being ligated with NotI 
/ BamHI cut pWRG7077. This places the gene between the 
CMV promoter and bovine growth hormone polyadenylation 
signal. Glones were sequenced. No clone was 100% correct, 

20 but clone #16 was corrected by replacing the 4 03bp Kpnl- 
BamHI fragment containing 3 errors with a correct Kpnl- 
BamHI fragment from clone#21. The final clone was 
verified by sequencing, (see figure 5) 

25 

Example 5 : Optimised RT 

Gene of interest 

The synthetic gene coding for the RT portion of the pol 
30 gene of HIV-1 clade B strain HXB2 , optimised for 

expression in mammalian cells was excised from plasmid 
p7077-RT3 as a 1697bp NotI / BamHI fragment, gel 
purified, and cloned into the NotI & BamHI sites of 
p7313-ie (derived from pspC31) to place the gene 
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downstream of an Iowa length HCMV promoter + exonl, and 
upstream of a rabbit globin poly-adenylation signal. 
(R7004 p27) (figure 6) 



5 Example 6 

Plasmid; 7 077trNef20 
Gene of interest 

The insert comprises part of the Nef gene from the HIV-1 
clade B strain HXB2 . 195bp are deleted from the 5' end of 

10 the gene removing the codons for the first 65 amino acids 
of Nef. In addition the premature termination codon in 
the published HXB2 nef sequence has been repaired (TAG to 
TGG [Trp] ) as has been described for plasmid 
pl7/24trNEFl . The truncated nef sequence was PGR 

15 amplified from the plasmid pl7/24trNef 1 - The sequence 
cloned is equivalent to positions 8992-9417 of the HXB2 
reference sequence (GenBank entry K03455) . To ensure 
expression the cloned sequence has an additional codon at 
the 5' end not present in the original gene - AUG (Met) . 

20 

Primers : 

S t rNe f ( sense ) ATAAGAATGCGGCCGCCATGGTGGGTTTTCCAGTCACACCTT 
[SEQ ID NO: 1] 
AS t rNe f ( ant i s ens e ) 
25 CGCGGATCCTCAGCAGTTCTTGAAGTACTCC [SEQ ID NO : 2] 

PGR: 94° C 2min, then 25 cycles: 94' C 30sec, 50° C 

3 0sec, 72° C 2min, ending 72° C Smin 

30 Cloning : 

The 455bp RT PGR product was gel purified, cut with 
restriction endonucleases NotI and Bam HI and ligated 
into Notl/BamHI cut vector WRG7077. This places the gene 
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between the CMV promoter/ intron A and the Bovine growth 
hormone polyadenylation signal. 

5 Example 7 

Plasmid: 7077RT 8 
Gene of interest 

The RT portion of the pol gene was derived from the HIV~1 
clade B strain HXB2 . It was PGR amplified from the 
10 plasmid p7077Poll4. 

The sequence cloned is equivalent to positions 2550-4234 
of the HXB2 reference sequence (GenBank entry K03455) . To 
ensure expression the cloned sequence has two additional 
15 codons at the 5' end not present in the original gene - 
AUG GGC (Met Gly) . 

Primers : 

SRT (sense) ATAAGAATGCGGCCGCCATGGGCCCCATTAGCCCTATTGAGACT 
20 [SEQ ID NO: 3] 

ASRT ( ant i sense) 

CGCGGATCCTTAATCTAAAAATAGTACTTTCCTGATT [SEQ ID NO: 4] 

PGR: 94° C 2min, then 25 cycles: 94° C SOsec, 50" C 

25 3 0sec, 72' C 4min, ending 72° C 5min 

Cloning : 

The 172 Obp RT PGR product was gel purified, cut with 
restriction endonucleases NotI and Bam HI and ligated 
30 into Notl/BamHI cut vector WRG7077. This places the gene 
between the CMV promoter/ intron A and the Bovine growth 
hormone polyadenylation signal. 
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Example 8 

pl7/24opt/RT/trNef 13 ( ^Gagopt/RT/Nef ' ) 

This construct contains a PGR that causes an R to H amino 
5 acid change. 

Gene of Interest: 

The pl7/p24 portion of the codon optimised p55gag gene 
derived from the HIV-1 clade B strain HXB2 was PGR 
amplified from the plasmid pGagOPTrpr2 . The RT coding 

10 sequence was PGR amplified from the plasmid 7077RT 8. The 
truncated HXB2 Nef gene with the premature termination 
codon repaired (TGA [stop] to TGG [Trp] ) was amplified by 
PGR from the plasmid 7077trNef20. The three PGR products 
were designed to have overlapping ends so that the three 

15 genes could be joined in a second PGR. 

Primers : 

(P17/24) 

20 Spl7p24opt (sense) 

ATAAGAATGGGGCCGCGATGGGTGCGGGAGGTTCGGT [SEQ ID NO: 5] 
ASpl7p24optRTlinker (antisense) 

TGGGGGGGATGAAGAGTGTGGGTTTGTGTG [SEQ ID NO: 6] 

25 PGR: 94' C Imin, then 20 cycles: 94 C 30sec, 50° G 

30sec, 72° G 2min, ending 72° C 4min 

The 1114bp pl7/24opt product was gel purified. 

30 (RT) 

Spl7p24optRTl inker (sense) 

GAGAGTGTTGATGGGGGGGATTAGGGGTAT [SEQ ID NO : 7] 
ASRTtrNef linker (antisense) 
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AACCCACCATATCTAAAAATAGTACTTTCC [SEQ ID NO : 8] 

PGR: as above 

5 The 1711bp RT PGR product was gel purified 

(5' truncated nef) 

SRTtrNef linker (sense) 
10 GTATTTTTAGATATGGTGGGTTTTCGAGTCAG [SEQ ID NO: 9] 
AStrNef (antisense) 

CGCGGATGCTCAGCAGTTCTTGAAGTACTCC [SEQ ID NO : 10] 

PGR as above - 
15 The 44 8bp product was gel purified. 

The three PGR products were then stitched together in a 
second PGR with primers Spl7/24opt and AstrNef . 

20 PGR: 94° C Imin^ then 30 cycles: 94" C 30sec, 50° C 

30sec, 72" G 4min^ ending 72" G 4min 

The 3253bp product was gel purified, cut with restriction 
endonucleases Not I and BamHI and cloned into the Not I 
25 BamHI sites of vector WRG7077. This places the gene 

between the GMV promoter/intron A and the Bovine growth 
hormone polyadenylation signal . 

Example 9 

30 Plasmid: pGRN#16 (pl7/p24opt corr /RT/trNef.) 
Gene of interest: 

The polyprotein generated by pl7/24opt/RT/trNef 13 

( ^Gagopt/RT/Nef ) was observed to express a truncated 

35 product of -3 0kDa due to a cluster of unfavourable codons 
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within p24 around aminoacid 270. These were replaced with 
optimal codons by PGR stitching mutagenesis. 
pl7/24opt/RT/trNef 13 was used as a template to amplify 
the portion of Gag 5 ' to the mutation with primers 
5 Spl7/p24opt and GTR-A, and the portion of Gag 3 ' to the 
mutation with primers GTR-S and Aspl 7/p24optRTl inker . The 
overlap of the products contained the codon changes, and 
the gel purified products were stitched together using 
the Spl7/p24opt and Aspl7/p24optRTlinker primers. The 
10 product was cut with NotI and Agel and inserted into 
similarly cut pl7/24opt/RT/trNef 13 , to generate pGRN. 
Clone #16 was verified and progressed. 

Primers : 

15 

5 ' PGR : 

Spl7p24opt (sense) 

ATAAGAATGGGGGCGGGATGGGTGCGGGAGGTTGGGT [SEQ ID NO : 11] 

20 

GTR-A (Antisense) 

GGGGAGGATGTTGTTGAGGCCCAGGATGATCGACGGTTTATAGATTTCTGG [SEQ 
ID NO: 12] 

25 3 ' PGR 

Sense : GTR-S (Sense) 

ATCGTGGGGCTGAACAAGATCGTGGGCATGTACTCTCCGAGATGCATCC [ SEQ ID 
NO: 13] 

30 ASpl7p24optRTlinker (antisense) 

TGGGGCCCATCAACACTCTGGCTTTGTGTC [SEQ ID NO: 14] 

PGR conditions for individual products and stitch, using 
PWO DNA polymerase (Roche) : 

35 
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95°C Imin, then 20 cycles 95°C 30s, 55°C 30s, 72°C 180s, 
ending 72 ""C 120s and 4°C hold. 

The 1114bp product was gel purified and cut with NotI and 
5 Agel to release a 664 7bp fragment which was gel purified 
and ligated into NotI / Agel cut gel purified 
pl7/24opt/RT/trNef 13 to generate pGRN#16. 



Example 10: 

10 Plasmid: p73i-GRN2 Clone #19 

{pl7/p24 (opt) /RT (opt) trNef ) - repaired 



Gene of interest: 

The pl7/p24 portion of the codon optimised gag, codon 
15 optimised RT and truncated Nef gene from the HIV-1 clade 
B strain HXB2 downstream of an Iowa length HCMV promoter 
+ exonl, and upstream of a rabbit (3-globin poly- 
adenylation signal . 



20 Plasmids containing the trNef gene derived from plasmid 
pl7/24trNefl contain a PGR error that gives an R to H 
amino acid change 19 amino acids from the end of nef. 
This was corrected by PGR mutagenesis, the corrected nef 
PGR stitched to codon optimised RT from p7077-RT3, and 

25 the stitched fragment cut with Apal and BamHI, and cloned 
into Apal /BamHI cut p73i-GRN. 



Primers : 

PGR coRT from p7077-RT3 using primers: 
30 (Polymerase = PWO (Roche) throughout. 



Sense : Ul 

GAATTCGCGGCCGCGATGGGCCCCATCAGTCCCATCGAGAGCGTGCCGGTGAAGCTG 
AAACCCGGGAT [SEQ ID NO: 15] 
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AScoRT-Nef 

GGTGTGACTGGAAAACCCACCATCAGCACCTTTCTAA.TCCCGGC [SEQ ID NO: 
16] 

5 Cycle: 95°C(30s) then 20 cycles 95°C(30s), 55°C(30s), 
72°C(180s), then 72°C(120s) and hold at 4°C 
The 1.7kb PGR product was gel purified. 

PGR 5'' Nef from pl7/24trNefl using primers: 

10 

Sense: S-Nef 

ATGGTGGGTTTTCCAGTCACACC [SEQ ID NO : 17] 
Antisense : ASNef -G: 

GATGAAATGCTAGGCGGCTGTCAAACCTC [SEQ ID NO : 18] 
15 Cycle: 95°C(30s) then 15 cycles gS'^CCBOs), 55^C(30s), 
72°C(60s), then 72°C(120s) and hold at 4°C 

PGR 3' Nef from pl7/24trNefl using primers: 

20 Sense: SNEF-G 

GAGGTTTGACAGCCGCCTAGCATTTCATC [SEQ ID NO : 19] 

Antisense : 

AStrNef (antisense) 

CGCGGATCCTCAGCAGTTCTTGAAGTACTCC [SEQ ID NO : 2 0] 
25 Cycle: gS^'COOs) then 15 cycles 95°C(30s), 55°C(30s), 
72°C{60s), then 72''C(120s) and hold at 4^0 

The PGR products were gel purified. Initially the two Nef 
products were stitched using the 5' (S~Nef) and 3'' 
30 (AstrNef) primers. 

Cycle: 95''C(30s) then 15 cycles 95°C(30s)^ 55°C(30s), 
72°C(60s), then 72°C(180s) and hold at 4°C . 
The PGR product was PGR cleaned, and stitched to the RT 
product using the Ul and AstrNef primers; 



-42 



wo 03/025003 



PCT/EP02/10592 



Cycle: 95°C(30s) then 20 cycles 95°C(30s), 55°C(30s), 
72''C(180s), then 72°C(180s) and hold at 4°C 

The 2 . Ikb product was gel purified, and cut with Apal and 
5 BamHI. The plasmid p73I-GRN was also cut with Apal and 

BamHI gel purified and ligated with the Apal -Bam RT3trNef 
to regenerate the pl7/p24 (opt) /RT (opt ) trNef gene. 

Example 11 

10 p73i-GN2 Clone #2 (pl7/p24opt/ trNef ) - repaired 
Gene of interest: 

The pl7/p24 portion of the codon optimised gag and 
truncated Nef genes from the HIV-1 clade B strain HXB2 
15 downstream of an Iowa length HCMV promoter + exonl, and 
upstream of a rabbit [B-globin poly- adenylat ion signal. 

Plasmids containing the trNef gene derived from plasmid 
pl7/24trNefl contain a PGR error that gives an R to H 

20 amino acid change 19 amino acids from the end of Nef. 

This was corrected by PGR mutagenesis and the corrected 
fragment cut with Bglll and BamHI, and cloned into 
Bglll/BamHI cut p73I-GN. (Figure 12) regenerate the 
corrected pl7/p24opt /trNef fusion gene downstream of the 

25 Iowa length HCMV promoter + exonl, and upstream of the 
rabbit p-globin polyadenylation signal - 

PGR 5' Nef from pl7/24trNefl using primers s 

Polymerase = PWO (Roche) throughout. 

30 

Sense: S-Nef 

ATGGTGGGTTTTCCAGTCACACC [SEQ ID NO : 21] 
Antisense : ASNef -G : 

GATGAAATGCTAGGCGGCTGTCAAACCTC [SEQ ID NO : 22] 
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Cycle: 95°C(30s) then 15 cycles 95°C(30s), 55°C(30s), 
72°C(60s), then 72°C(120s) and hold at 4°C 



PGR 3' Nef from pl7/24trNefl using primers: 

5 Sense: SNBF-G 

GAGGTTTGACAGCCGCCTAGCATTTCATC [SEQ ID NO : 23] 
Ant i s ens e ; A S t rNe f 

CGCGGATCCTCAGCAGTTCTTGAAGTACTCC [SEQ ID NO : 24] 

10 Cycle: 95°C(30s) then 15 cycles 95°C(30s), 55°C(30s), 
72''C(60s), then 72°C{120s) and hold at 4°C 



The PGR products were gel purified, and stitched using 
the 5' (S-Nef) and 3' (AstrNef) primers. 
15 Cycle: 95°C(30s) then 15 cycles 95°C(30s), 55°C(30s), 
72''C{60s), then 72°C(180s) and hold at 4°C . 
The PGR product was PGR cleaned, cut with Bglll / BamHI , 
and the 3 67bp fragment gel purified and cloned into 
Bglll/BamHI cut gel purified p73i-GN. 

20 

Example 12 

Plasmid: p73I-RT w229k (Inactivated RT) 



Gene o£ Interest: 

25 Generation of an inactivated RT gene downstream of an 
Iowa length HCMV promoter + exon 1, and upstream of a 

rabbit p-globin poly-adenylation signal . 

Due to concerns over the use of an active HIV RT species 
30 in a therapeutic vaccine inactivation of the gene was 

desirable. This was achieved by PGR mutagenesis of the 
RT (derived from P73I-GRN2) amino acid position 22 9 from 
Trp to Lys (R7271 pl-28) . 
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Primers : 

PGR 5'' RT + mutation using primers: 
(polymerase = PWO (Roche) throughout) 

5 Sense : RT3-u:l 

GAATTCGCGGCCGCGATGGGCCCCATCAGTCCCATCGAGACGGTGCCGGTGAAGCTG 
AAACCCGGGAT [SEQ ID NO: 25] 

Antisense : AScoRT-Trp22 9Lys 
10 GGAGCTCGTAGCCCATCTTCAGGAATGGCGGCTCCTTCT [SEQ ID NO: 2 6] 

Cycle : 

1 X [94°C (30s) ] 

15 X [94''C (30s)/55°C (30s)/72°C (60s)] 
15 1 X [72°C (180s) ] 
PGR gel purify 

PGR 3' RT + mutation using primers: 

20 Antiense: RT3- 1:1 

GAATTCGGATCCTTACAGCAGCTTTCTAATCCCCGCACTCACCAGCTTGTCGACCTG 
CTCGTTGCCGC [SEQ ID NO: 27] 

Sense : ScoRT-Trp22 9Lys 
25 CCTGAAGATGGGCTACGAGCTCGATG [SEQ ID NO : 28] 

Cycle : 

1 X [94''C (30s) ] 

15 X [94°C (30s)/55°C (30s) /72^C (60s)] 
30 IX [72°C (180s) ] 
PGR gel purify 

The PGR products were gel purified and the 5*^ and 3*^ ends 
of RT were stitched using the 5' (RT3-U1) and 3'' (RT3-L1) 
35 primers . 
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Cycle : 

1 X [94°C (30s) ] 

15 X [94''C (30s)/55°C (30s)/72°C (120s)] 
1 X [72°C (180s) ] 

5 

The PGR product was gel purified, and cloned into 
p7313ie, utilising NotI and BamHI restriction sites, to 
generate p73I-RT w229k. (See figure 13) 

10 

Example 13 : 

Plasmid: pVBi-Tgrn (#3) 

Gene of interest: 

15 The pl7/p24 portion of the codon optimised gag, codon 

optimised RT and truncated Nef gene from the HIV-1 clade 
B strain HXB2 downstream of an Iowa length HCMV promoter 
+ exonl^ and upstream of a rabbit (3-globin poly- 
adenylation signal . 

20 

Triple fusion constructs which contain an active form of 
RT, may not be acceptable to regulatory authorities for 
human use thus inactivation of RT was achieved by 
Insertion of a Nhel and Apal cut fragment from p73i-RT 
25 w229k, into Nhel/Apal cut p73i-GRN2#19 (Figure 14) . This 
results in a W K change at position 229 in RT, 

Example 14 
30 p73l-Tnrg (#16) 

Gene of interest: 

The truncated Nef , inactivated codon optimised RT and 
pl7/p24 portion of the codon optimised gag gene from the 
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HIV-1 clade B strain HXB2 downstream of an Iowa length 
HCMV promoter + exonl, and upstream of a rabbit (3-globin 
poly- adenylat ion signal . 

5 The order of the genes in the polyprotein encoded by 
p73i-Tgm were rearranged by PGR and PGR stitching to 
generate p73I-Tnrg (Figure 15) . Each gene was PGR 
amplified and gel purified prior to PGR stitching of the 
genes to form a single polyprotein. The product was gel 
10 purified, Notl/BamHI digested and ligated into Notl/BamHI 
cut p7313ie. 

Primers : 

trNGf PCR 

15 

S-Nef (Not I) 

CATTAGAGGGGCGGCGATGGTGGGTTTTCGAG [SEQ ID NO : 2 9] 

AS-Nef-coRT linker 
20 GATGGGAGTGATGGGGCCCATGCAGTTCTTGAACTACTGCGG [SEQ ID NO: 
30] 

RTw229k PCR 

25 S-coRT 

ATGGGCCCCATGAGTCCCATGGAG [SEQ ID NO: 31] 

AS-coRT-pl7p24 linker 

CAGTACCGAAGGTCGGGCACCCATGAGCACCTTTCTAATCCCCGC [SEQ ID NO: 
30 32] 

pl7p24opt PCR 

S-pl7p24opt 

35 ATGGGTGCCCGAGCTTCGGTACTG [SEQ ID NO: 33] 
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AS-pl7p24opt (BamHI) 

GATGGGGGATCCTCACAACACTCTGGCTTTGTGTCC [SEQ ID NO : 34] 

5 PGR conditions for individual products and stitching 
using VENT DNA polymerase (NEB) ; 

1 X [94°C (30s) ] 

25 X [94''C (30s)/55°C (30s)/72°C (120s [pl7p24 or RT] or 
10 60s [trNef])] 

1 X [72''C (240s) ] 

The PGR products were gel purified and used in a PGR 
stitching utilising the primers S-trNef (I^Totl) and AS- 
15 pl7p24opt (BainHI) : 

1 X [94''C (30s) ] 

25 X [94°C (30s)/55°C (30s)/72°C (210s)] 
1 X [72°C (240s) ] 

20 

The 3 0 00bp product was gel purified and cut with Not I and 
BamHI which was PGR cleaned and ligated into Notl/BamHI 
digested gel purified p7313ie to generate p73i-Tnrg. 

25 Example 15 : 

1. Plasmid: P73i-Tngr (#3) 

Gene of Interest: 

The truncated Nef , pl7/p24 portion of the codon 
30 optimised gag and inactivated codon optimised RT gene 

from the HIV~1 clade B strain HXB2 downstream of an Iowa 
length HCMV promoter + exonl^ and upstream of a rabbit 
globin poly-adenylation signal - 
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The order of the genes in the polyprotein encoded by 
p73i-Tgrn were rearranged by PGR to generate p73I-Tngr 
(Figure 16) . Codon optimised pl7/p24 and RT were 
generated as a single product, and PGR stitched to 
5 amplified trNef . The product was gel purified, 

Notl/BamHI digested and ligated into Notl/BamHI cut 
p7313ie. 

Primers : 

10 P17/p24 - RT 3^ PGR : 

Spl7p24opt (sense) 

ATGGGTGCGCGAGCTTCGGTACTG [SEQ ID NO: 35] 

15 RT3 1:1 (antisense) 

GAATTCGGATCCTTAGAGCACCTTTCTAATCCCCGCACTCACCAGCTTGTCGACCTG 
CTCGTTGCCGC [SEQ ID NO: 36] 

TrNef 5^ PGR 

20 

S-Nef (Not I) 

CATTAGAGCGGCCGGGATGGTGGGTTTTCCAC [SEQ ID NO : 3 7] 
AS-Nef -pl7p24 

25 CAGTACCGAAGCTCGGGCACCCATGCAGTTCTTGAACTACTCCGG [SEQ ID NO: 
38] 

PGR conditions for individual products and stitching 
using VENT DNA polymerase (NEB) : 

30 

1 X [94''C (30s) ] 

25 X [94°C (30s)/55°C (30s)/72^C (180s [pl7p24+RT] or 60s 
[trNef] or 210s [stitching])] 
1 X [72°C (240s) ] 

35 
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The 3000bp product was gel purified and cut with NotI and 
BamHI which was PGR cleaned and ligated into Notl/BamHI 
digested gel purified p7313ie to generate p73i-Tngr. 

S Example 16: 

Plasmid: p73I-Trgn (#6) 

Gene of Interest: 

The inactivated codon optimised RT, pl7/p24 portion of 
10 the codon optimised gag and truncated Nef gene from the 
HIV-1 clade B strain HXB2 downstream of an Iowa length 
HCMV promoter + exonl, and upstream of a rabbit (3-globin 
poly-adenylation signal . 

15 The order of the genes within the construct was achieved 
by PGR amplification of pl7p24-trNef and RTw229k from the 
plasmids p73I-GN2 and p73I-RTw229k respectively, PGR 
stitching was performed and the product gel purified and 
Notl/BamHI cut prior to ligation with Notl/BamHI digested 

20 p7313ie. Sequencing revealed that pl7p24 was not fully 
optimised a 7 0 0bp fragment was then Age I /Muni cut from 
the coding region and replaced with Muni/Age fragment 
from p73i-Tgrn#3 containing the correct coding sequence. 
(See figure 17) . 

25 

Primers : 

pl7p24-trNef PGR 
S-pl7p24opt 

30 ATGGGTGCCCGAGCTTCGGTAGTG [SEQ ID NO: 3 9] 
AstrNef (BamHI) 
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RTw229k 
RT3-U: 1 

GAATTCGCGGCCGCGATGGGCCCCATCAGTCCCATCGAGACCGTGCCGGTGAAGCTG 
5 AAACCCGGGAT [SEQ ID NO: 40] 

AS-coRT-pl7p24opt linker 

CAGTACCGAAGCTCGGGCACCCATCAGCACCTTTCTAATCCCCGC [SEQ ID NO : 
41] 

10 

PGR conditions for individual products and stitching 
using VENT DNA polymerase (NEB) : 

1 X [94''C (30s) ] 
15 25 X [94°C (30s)/55''C (30s)/72°C (120s (PGR) or 180s 
(stitching) 
1 X [72°G (240s) ] 

The 3 0 00bp product from the PGR stitch was gel purified 
20 and cut with Not I and BamHI which was PGR cleaned and 

ligated into Notl/BamHI digested gel purified p7313ie to 
generate p73i-Tngr. Sequence analysis showed that pl7p24 
sequence obtained from p73I-GN2 was not fully codon 
optimised and that this had been carried over into the 
25 new plasmid. This was rectified by cutting a 700bp 
fragment from p73i-Tngr cut with Muni and Agel, and 
replacing it by ligation with a 7 00bp Munl/Agel digested 
product from p73i--Tgrn to generate the construct p731- 
Tngr#6 . 

30 

Example 17 : 

Plasmid: p73i-Trng (#11) 
Gene of Interest: 
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The inactivated codon optimised RT, truncated Nef and 
pl7/p24 portion of the codon optimised gag gene from the 
HIV-1 clade B strain HXB2 downstream of an Iowa length 
HCMV promoter + exonl, and upstream of a rabbit p-globin 
5 poly-adenylation signal . 

The order of the genes within the construct was achieved 
by PGR amplification of the RT~trNef and pl7p24 genes 
from p73i-Tgrn. PGR stitching of the two DNA fragments 
10 was performed and the 3kb product gel purified and 

Notl/BamHI cut prior to ligation with Notl/BamHI digested 
p7313ie, and yielded p73I Trng (#11) . 

Primers : 

15 RTw22 9k-trNef 

RT3-U: 1 

GAATTCGCGGCCGCGATGGGCGGGATCAGTCGGATGGAGAGGGTGGCGGTGAAGCTG 
AAACGCGGGAT [SEQ ID NO : 42] 

20 

AS-Nef -pl7p24opt linker 

CAGTACCGAAGGTCGGGCACCGATGGAGTTCTTGAAGTAGTGGGG [SEQ ID NO: 
43] 

25 

P17p24 
S-pl7p24opt 

ATGGGTGGGGGAGGTTCGGTACTG [SEQ ID NO: 44] 

30 

AS-pl7p24opt (BamHI) 

GATGGGGGATGGTGACAAGACTCTGGGTTTGTGTGG [SEQ ID NO : 45] 

PGR conditions for individual products and stitching 
35 using VENT DNA polymerase (NEB) : 
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1 X [94°C (30s) ] 

25 X [94°C (30s)/55°C {30s) /72°C (120s (PGR of genes) or 

180s (stitching) 
5 1 X [72°C (240s) ] 

The 3 000bp product from the PGR stitch was gel purified 
and cut with NotI and BamHI which was PGR cleaned and 
ligated into Notl/BamHI digested gel purified p7313ie to 
10 generate p73i-Tngr. 

Example 18: 
p73i-Tgnr {#fl) 

15 Gene of interest: 

The pl7/p24 portion of the codon optimised gag, truncated 
Nef and codon optimised inactivated RT gene from the 
HIV-1 clade B strain HXB2 downstream of an Iowa length 
HCMV promoter + exonl, and upstream of a rabbit (3-globin 
20 poly-adenylation signal . 

The order of the genes within the construct was achieved 
by PGR amplification of pl7p24-trNef and RTw22 9k from the 
plasmids p73I-GN2 and p73I-RTw22 9k respectively. PGR 

25 stitching was performed and the product gel purified and 
Notl/BamHI cut prior to ligation with Notl/BamHI digested 
p7313ie. Two sequence errors were spotted in the sequence 
(pl7p24 and RT) which were subsequently repaired by 
replacement with correct portions of the genes utilising 

30 restriction sites within the polyprotein. (See figure 
19) . 

Primers : 
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pl7p24-trNef PGR 
S-pl7p24opt 

ATGGGTGCCCGAGCTTCGGTACTG [SEQ ID NO: 46] 

5 

AS -Nef - coRTl inker 

GATGGGACTGATGGGGCCCATGCAGTTCTTGAACTACTCCGG [SEQ ID NO: 
47] 

10 RTw229k 
S - coRT 

ATGGGCCCCATCAGTCCCATCGAG [SEQ ID NO : 48] 
15 RT3-1:1 

GAATTCGGATCCTTACAGCACCTTTCTAATCCCCGCACTCACCAGCTTGTCGACCTG 
CTCGTTGCCGC [SEQ ID NO: 49] 

PGR conditions for individual products and stitching 
20 using VENT DNA polymerase (NEB) : 

1 X [94°C (30s)] 

25 X [94°C (30s)/55''C (30s) /72°C (120s (PGR) or 180s 
(stitching) 
25 1 X [72''C (240s) ] 

The 3000bp product was gel purified and cut with NotI and 
BattiHI which was PGR cleaned and ligated into Notl/BamHI 
digested gel purified p7313ie to generate p73i-Tngr. 

30 Sequencing revealed that pl7p24 was not fully optimised a 
700bp fragment was subsequently Age I /Muni cut from the 
coding region and replaced with Muni /Age fragment from 
p73i-Tgrn#3 containing the correct coding sequence. The 
polyprotein also contained a single point mutation 

35 (G2609A) resulting in an amino acid substitution of Thr 
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to Ala in the RT portion of the polyprotein. The 
mutation was corrected by Apal/BamHI digestion of the 
construct and PGR clean up to remove the mutated 
sequence, which was replaced by ligation with an 
Apal/BamHI digested portion of RT from p73i-Tgnr. 



Example 19 : 

Preparation of plasmid-coated ^gold slurry' for ^gene 
gun' DNA car 'bridges 

Plasmid DNA (approximately Ipg/yil) , eg. 100 ug, and 2iam 
gold particles, eg. 50 mg, (PowderJect) , were suspended 
in 0.05M spermidine, eg. 100 ul, (Sigma). The DNA was 
precipitated on to the gold particles by addition of IM 
CaCla, eg. lOOul (American Pharmaceutical Partners, Inc., 
USA) . The DNA/gold complex was incubated for 10 minutes 
at room temperature, washed 3 times in absolute ethanol, 
eg. 3 X 1 ml, (previously dried on molecular sieve 3A 
(BDH) ) . Samples were resuspended in absolute ethanol 
containing 0.05mg/ml of polyvinylpyrrolidone ( PVP, 
Sigma), and split into three equal aliquots in 1.5 ml 
microfuge tubes, (Eppendorf ) . The aliquots were for 
analysis of (a) ^gold slurry', (b) eluate- plasmid 
eluted from (a) and (c) for preparation of gold/ plasmid 
coated Tefzel cartridges for the ^gene gun', (see Example 
3 below) . For preparation of samples (a) and (b) , the 
tubes containing plasmid DNA / ^gold slurry' in ethanol / 
PVP were spun for 2 minutes at top speed in an Eppendorf 
5418 microfuge, the supernatant was removed and the 'gold 
slurry' dried for 10 minutes at room temperature. Sample 
(a) was resuspended to 0.5 - 1.0 ug / ul of plasmid DNA 
in TE pH 8.0, assuming approx. 50 % coating. For elution, 
sample (b) was resuspended to 0.5 - 1.0 ug / ul of 
plasmid DNA in TE pH 8 . 0 and incubated at 37°C for 30 
minutes, shaking vigorously, and then spun for 2 minutes 
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at top speed in an Eppendorf 5418 microfuge and the 
supernatant, eluate, was removed and stored at -20°C. The 
exact DNA concentration eluted was determined by 
spectrophotometric quantitation using a Genequant II 
(Pharmacia Biotech) . 

Example 20: 

Preparations of Cartridges for DNA iinniunisation 

Preparation of cartridges for the Accell gene transfer 
device was as previously described (Eisenbraun et al DNA 
and Cell Biology, 1993 Vol 12 No 9 pp 791-797; Pertner et 
al) . Briefly, plasmid DNA was coated onto 2 jam gold 
particles (DeGussa Corp., South Plainfield, N.J., USA) 
and loaded into Tefzel tubing, which was subsequently cut 
into 1.2 7 cm lengths to serve as cartridges and stored 
desiccated at 4°C until use. In a typical vaccination, 
each cartridge contained 0 . 5 mg gold coated with a total 
of 0.5 Jig DNA/ cart ridge. 

Example 21: 

Iinmune Response to HIV antigens following DNA vaccination 
utilising the gene gun. 

Mice (n=3/group) were vaccinated with antigens encoded by 
nucleic acid and located in two vectors. P7077 utilises 
the HCMV IE promoter including Intron A and exon 1 (fcmv 
promoter) . P73I delivers the same antigen, but contains 
the HCMV IE promoter (icmv promoter) that is devoid of 
Intron A, but includes exon 1 . 

Plasmid was delivered to the shaved target site of 
abdominal skin of Fl (C3H x Balb/c) mice. Mice were 
given a primary immunisation of 2 x 0 . 5 DNA on day 0, 
boosted with 2 x 0 . 5 DNA on day 35 and cellular 
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response were detected on day 4 0 using IFN - gamma 
Elispot . 



• P73I - empty vector 
5 • P7077 - empty vector 

• P70 77 GRN - (f CMV promoter) Gag, RT, Nef 

• P73I GRN - (i CMV promoter) Gag, RT, Nef 

• P73I GR3N - (CMV promoter) Optimised Gag, 
Optimised RT, Nef 

10 • P7077 GN - (f CMV promoter) Gag, Nef 

• P73I GN - (i CMV promoter) Gag, Nef 



Cytotoxic T Cell Responses 

15 The cytotoxic T cell response was assessed by CD8+ T 
cell-restricted IFN-y ELISPOT assay of splenocytes 
collected 5 days later. Mice were killed by cervical 
dislocation and spleens were collected into ice-cold PBS. 
Splenocytes were teased out into phosphate buffered 

20 saline (PBS) followed by lysis of red blood cells (1 

minute in buffer consisting of 155mM NH4CI, 10 mM KHCO3, 
0 . ImM EDTA) . After two washes in PBS to remove 
particulate matter the single cell suspension was 
aliquoted into ELISPOT plates previously coated with 

25 capture IFN-y antibody and stimulated with CDS -restricted 
cognate peptide (Gag, Nef or RT) . After overnight 
culture, IFN-y producing cells were visualised by 
application of ant i -murine IFN-y-biotin labelled antibody 
(Pharmingen) followed by streptavidin -conjugated 

30 alkaline phosphatase and quantitated using image 
analysis . 



The result of this experiment are shown in figures 20, 
21, and 22 . 
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Example 22: 

Iinnvuiiogenicity of Vaccine Constructs 

5 !• Cellular Assays 

The cellular immune response comprises cytotoxic CDS 
cells and helper CD4 cells. A sensitive method to detect 
specific CDS and CD4 cells is the ELlspot assay which can 
be used to quantify the number of cells capable of 

10 secreting interferon-y or IL-2 . The ELlspot assay relies 
on the capture of cytokines secreted from individual 
cells. Briefly, specialised microtitre plates are coated 
with anti-cytokine antibodies. Splenocytes isolated from 
immunised animals are incubated overnight in the presence 

15 of specific peptides representing known epitopes (CDS) or 
proteins (CD4) . If cells are stimulated to release 
cytokines they will bind to the antibodies on the surface 
of the plate surrounding the locality of the individual 
producing cells. Cytokines remain attached to the coating 

20 antibody after the cells have been lysed and plates 

washed- The assay is developed in a similar way to an 
ELISA assay using a biotin/avidin amplification system. 
The number of spots equates to the number of cytokine 
producing cells. 

25 

CDS responses to the following K2'^-restricted murine 
epitopes: Gag (AMQMLKETI) , Nef (MTYKAAVDL) and RT 
(YYDPSKDLI) and CD4 responses to Gag and RT proteins were 
recorded for all 6 constructs - The results of these 
30 assays were analysed statistically and constructs were 

ranked according to their immunogenicity . The result is 
shown in figure 23 of the figures. 



2 . Humoral Assays 

35 
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Blood samples were collected for antibody analysis at 7 
and 14 days post-boost from two experiments- Serum was 
separated and stored frozen until antibody titres could 
be measured using specific ELISA assays. All samples were 

5 tested for antibodies to Gag, Nef and RT. Briefly, ELISA 
plates were coated with the relevant protein. Excess 
protein was washed off before diluted serum samples were 
incubated in the wells. The serum samples were washed off 
and ant i -mouse antiserum conjugated to an appropriate tag 

10 was added. The plate was developed and read on a plate 
reader. The results are shown in figure 24. 

3. Antibody Data 

15 Antibody titres were measured for all six constructs in 
four experiments. Construct p73i-GNR consistently 
generated no antibody responses to Gag and limited 
antibody responses to Nef. The reason for this is 
unclear, as T-cell responses were observed from 

20 splenocytes isolated from the same mice, indicating that 
the Gag protein was being expressed in vivo. 

The ranking for the generation of Gag specific antibodies 
was : 

25 

RNG>GRN>NRG>RGN>NGR>GNR 

Analysis cellular immunology data 

30 The objective was to rank the 6 constructs on the basis 
of spot count data from 3 immunology experiments. Three 
sets of responses were assessed: 

CD8 responses to Gag, Nef and RT at Day 7 (7 days post 
35 primary) , 
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CD4 responses to Gag and RT at Day 35 (7 days post 
boost) , 

CDS responses to Gag, Nef and RT at Day 35 (7 days post 
boost) . 

5 

Each response (e.g. CDS response to Gag) was modeled 
using a linear mixed effect model in SAS version 8. The 
model included fixed effects of construct, whether the 
particular antigen (Gag, Nef or RT) was present or 

10 absent, and whether IL-2 was present or absent. In 

addition, for CDS responses, where data were available 
from each individual mouse, subject was included as a 
random effect in the model. The model included 
interaction terms to allow for a different effect of 

15 construct for each combination of the antigen 
(present /absent) and IL-2 (present /absent ) . 

From the model, the difference in adjusted mean response 
between each construct and p7313 (the control group) was 

20 estimated separately for each combination of antigen 

(present /absent) and IL-2 (present /absent ) , together with 
a p-value indicating whether the difference was 
statistically significant. Based on the differences and 
p-values in the presence of the antigen and the absence 

25 of IL-2, constructs were ranked, by assigning a score of 
6 to the construct with the largest difference, 5 to the 
next largest, etc, but 0 to any constructs where the 
difference was not statistically significant at the 5% 
level . 

30 

The assumptions of the model - that the residuals were 
normally distributed with constant variance, were 
assessed using graphical methods and sensitivity 
analyses, where first a log and second a square root 
35 transformation of the response was modeled. The ranking 
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of the constructs was not sensitive to departures from 
the assumptions of the model. 

Having calculated the ranks for each response in each 
5 experiment separately, total ranks for the 3 sets of 

responses were calculated across all 3 experiments. The 
following table shows the total rankings across the 3 
experiments . 



Total rankings 


of constructs for e 


ach of 3 


sets of 




r&sT^onaes, combined across 3 Izamunology experiments 


• 


Construct 


Day 7 (7 days post 


Day 3 5 


(7 days 


post 




primary) 




boost) 






CDS 


CD4 


CDS 


GRN 


5 


18 




3 


GNR 


17 


24 




28 


RGN 


28 


23 




33 


RNG 


25 


27 




37 


NRG 


25 


19 




0 


NGR 


4 


14 




10 



RNG has the highest ranking for both sets of responses at 
Day 35, and the second highest ranking behind RGN at Day 
15 7 . RGN also receives high rankings for both sets of 
responses at Day 35. 
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CLAIMS 

1. A nucleotide sequence that encodes an HIV-1 gag 
protein or fragment containing a gag epitope thereof and 
5 a second HIV antigen or a fragment encoding an epitope of 
said second HIV antigen, operably linked to a 
heterologous promoter . 

2 - A nucleotide sequence as claimed in claim 1 wherein 
10 the second antigen is selected from the group consisting 
of: Nef or a fragment thereof containing a nef epitope, 
or reverse transcriptase (RT) or a fragment thereof 
containing an RT epitope. 

15 3 . A nucleotide sequence as claimed in claim 1 or 2 
wherein the gag protein comprises pl7 . 

4 . A nucleotide sequence as claimed in claim 3 wherein 
the gag protein additionally comprises p24. 

20 

5. A nucleotide sequence as claimed in any one of 
claims 1 to 4 wherein the gag sequence is codon optimised 
to resemble the codon usage in a highly expressed human 
gene . 

25 

6. A nucleotide sequence as claimed in any one of 

claims 1-5 wherein the sequence encodes an HIV-1 gag 
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protein or fragment containing a gag epitope, a nef 
protein or a fragment containing a nef epitope and an RT 
protein or a fragment containing an RT epitope, 
preferably in the order RT, Gag, Nef or RT, Nef, Gag. 

5 

7. A nucleotide sequence as claimed in any of claim 2-6 
wherein the RT sequence or fragment thereof is codon 
optimised to resemble a highly expressed human gene. 

10 8 . A nucleotide sequence as claimed in any of claims 1 
to 7 wherein the nucleotide sequence encodes a nef 
protein or epitope thereof . 



9. A nucleotide sequence selected from the group 
15 consisting of: 

• Gag (pl7,p24) , Nef truncate 

• Gag (pl7,p24) (codon optimised), Nef (truncate) 

• Gag (pl7,p24) , RT, Nef (truncate) 

• Gag (pl7,p24) codon optimised, RT, Nef (truncate) 
20 • Gag (pl7,p24) codon optimised, RT codon optimised, 

Nef truncate 

• RT (codon optimised) , Gag (pl7, p24) codon 
optimised, Nef truncate 

• RT (codon optimised) , Nef truncate, gag pl7, p24 
25 codon optimised 
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10. A nucleotide sequence as claimed in any of claims 1 
to 9 wherein the heterologous promoter is the promoter 
from HCMV IE gene. 

5 11. A nucleotide sequence as claimed in claim 10 wherein 
the 5'' of the promoter comprises exon 1. 

12 . A vector comprising a nucleotide sequence as claimed 
in any one of claims 1 to 11. 

10 

13 . A vector as claimed in claim 12 which is a double 
stranded DNA plasmid. 

14 . A pharmaceutical composition comprising a nucleotide 
15 sequence of claim 1 to 11 or a vector of claim 12 or 13 

and a pharmaceutically acceptable excipient, diluent, 
carrier or adjuvant. 

15. A pharmaceutical composition as claimed in claim 14 
20 adapted for intra-muscular or intra-dermal delivery. 

16. A pharmaceutical composition as claimed in claim 14 
or 15 wherein the carrier is a gold bead. 

25 17. An intra-dermal delivery device comprising a 
pharmaceutical composition of claim 14, 15 or 16. 
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18. A method of treating a patient suffering from or 
susceptible to a disease comprising administration of a 
safe and effective amount of a pharmaceutical composition 
as claimed in any of claims 14 to 16 • 

5 

19. A nucleotide sequence of any of claims 1 to 11^ a 
vector of claim 12 or 13, or a composition of any of 
claims 14 to 16, for use in medicine. 

10 20. Use of a nucleotide sequence of any of claims 1 to 
11 in the manufacture of a medicament for the treatment 
of disease. 

21. A process for the production of a nucleotide as 
15 claimed in any of claims 1 to 11 comprising operably 
linking a nucleotide sequence encoding an HIV-1 gag 
protein or fragment thereof and a second HIV protein or 
fragement thereof to a heterologous promoter sequence. 
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Sequence o£ p55 gag insert in pGagOpt:rpr2 

5 ATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGAGAGCTGGACAGATGGGAGAAAATT 
AGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCATATCGTGTGGGCCTCG 
AGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACATCTGAGGGATGT 
CGCCAGATCCTGGGGCAATTGCAGCCATCCCTCCAGACCGGGAGTGAAGAGCTGAGG 
TCCTTGTATAACACAGTGGCTACCCTCTACTGCGTACACCAGAGGATCGAGATTAAG 

1 0 GATACCAAGGAGGCCTTGGACAAAATTGAGGAGGAGCAAAACAAGAGCAAGAAGAAG 
GCCCAGCAGGCAGCTGCTGACACTGGGCATAGCAACCAGGTATCACAGAACTATCCT 
ATTGTCCAAAACATTCAGGGCCAGATGGTTCATCAGGCCATCAGCCCCCGGACGCTC 
AATGCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCCATG 
TTCTCCGCTTTGAGTGAGGGGGCCACTCCTCAGGACCTCAATACAATGCTTAATACC 

1 5 GTGGGCGGCCATCAGGCCGCCATGCAAATGTTGAAGGAGACTATCAACGAGGAGGCA 
GCCGAGTGGGACAGAGTGCATCCCGTCCACGCTGGCCCAATCGCGCCCGGACAGATG 
CGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACTGCAAGAGCAAATC 
GGATGGATGACCAACAATCCTCCCATCCCAGTTGGAGAAATCTATAAACGGTGGATC 
ATTCTCGGTCTCAATAAAATTGTTAGAATGTACTCTCCGACATCCATCCTTGACATT 

20 AGACAGGGACCCAAAGAGCCTTTTAGGGATTACGTCGACCGGTTTTATAAGACCCTG 
CGAGCAGAGCAGGCCTCTCAGGAGGTCAAAAACTGGATGACGGAGACACTCCTGGTA 
CAGAACGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAGGCCCGGCTGCCACC 
CTGGAAGAGATGATGACCGCCTGTCAGGGAGTAGGCGGACCCGGACACAAAGCCAGA 
GTGTTGGCCGAAGCCATGAGCCAGGTGACGAACTCCGCAACCATCATGATGCAGAGA 

25 GGGAACTTCCGCAATCAGCGGAAGATCGTGAAGTGTTTCAATTGCGGCAAGGAGGGT 
CATACCGCCCGCAACTGTCGGGCCCCTAGGAAGAAAGGGTGTTGGAAGTGCGGCAAG 
GAGGGACACCAGATGAAAGACTGTACAGAACGACAGGCCAATTTTCTTGGAAAGATT 
TGGCCGAGCTACAAGGGGAGACCTGGTAATTTCCTGCAAAGCAGGCCCGAGCCCACC 
GCCCCCCCTGAGGAATCCTTCAGGTCCGGAGTGGAGACCACAACGCCTCCCCAAAAA 

30 CAGGAACCAATCGACAAGGAGCTGTACCCTTTAACTTCTCTGCGTTCTCTCTTTGGC 
AACGACCCGTCGTCTCAATAA [SEQ ID NO : 50] 



MGARASVLSG GELDRWEKIR LRPGGKKKYK LKHIVWASRE LERFAVNPGL 
LETSEGCRQI LGQLQPSLQT GSEELRSLYN TVATLYCVHQ RIEIKDTKEA 

35 LDKIEEEQNK SKKKAQQAAA DTGHSNQVSQ NYPIVQNIQG QMVHQAISPR 
TLNAWVKWE EKAFSPEVIP MFSALSEGAT PQDLNTMLNT VGGHQAAMQM 
LKETINEEAA EWDRVHPVHA GPIAPGQMRE PRGSDIAGTT STLQEQIGWM 
TNNPPIPVGE lYKRWIILGL NKIVRMYSPT SILDIRQGPK EPFRDYVDRF 
YKTLRAEQAS QEVKNWMTET LLVQNANPDC KTILKALGPA ATLEEMMTAC 

40 QGVGGPGHKA RVLAEAMSQV TNSATIMMQR GNFRNQRKIV KCFNCGKEGH 
TARNCRAPRK KGCWKCGKEG HQMKDCTERQ ANFLGKIWPS YKGRPGNFLQ 
SRPEPTAPPE ESFRSGVETT TPPQKQEPID KELYPLTSLR SLFGNDPSSQ 

[SEQ ID NO: 51] 
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Figure 3 

Sequence of the pl7/24trNEF insert in pl7/24trNEFl 

ATGGGTGCGAGAGCGTCAGTATTAAGCGGGGGAGAATTAGATCGATGGGAAAAAATT 
5 CGGTTAAGGCCAGGGGGAAAGAAAAAATATAAATTAAAACATATAGTATGGGCAAGC 
AGGGAGCTAGAACGATTCGCAGTTAATCCTGGCCTGTTAGAAACATCAGAAGGCTGT 
AGACAAATAGTGGGACAGCTACAACCATCCCTTCAGACAGGATCAGAAGAACTTAGA 
TCATTATATAATACAGTAGCAACCCTCTATTGTGTGCATCAAAGGATAGAGATAAAA 
GACACCAAGGAAGCTTTAGACAAGATAGAGGAAGAGCAAAACAAAAGTAAGAAAAAA 

10 GCACAGCAAGCAGCAGCTGACACAGGACACAGCAATCAGGTCAGCCAAAATTACCCT 
ATAGTGCAGAACATCCAGGGGCAAATGGTACATCAGGCCATATCACCTAGAACTTTA 
AATGCATGGGTAAAAGTAGTAGAAGAGAAGGCTTTCAGCCCAGAAGTGATACCCATG 
TTTTCAGCATTATCAGAAGGAGCCACCCCACAAGATTTAAACACCATGCTAAACACA 
GTGGGGGGACATCAAGCAGCCATGCAAATGTTAAAAGAGACCATCAATGAGGAAGCT 

15 GCAGAATGGGATAGAGTGCATCCAGTGCATGCAGGGCCTATTGCACCAGGCCAGATG 
AGAGAACCAAGGGGAAGTGACATAGCAGGAACTACTAGTACCCTTCAGGAACAAATA 
GGATGGATGACAAATAATCCACCTATCCCAGTAGGAGAAATTTATAAAAGATGGATA 
ATCCTGGGATTAAATAAAATAGTAAGAATGTATAGCCCTACCAGCATTCTGGACATA 
AGACAAGGACCAAAAGAACCCTTTAGAGACTATGTAGACCGGTTCTATAAAACTCTA 

20 AGAGCCGAGCAAGCTTCACAGGAGGTAAAAAATTGGATGACAGAAACCTTGTTGGTC 
CAAAATGCGAACCCAGATTGTAAGACTATTTTAAAAGCATTGGGACCAGCGGCTACA 
CTAGAAGAAATGATGACAGCATGTCAGGGAGTAGGAGGACCCGGCCATAAGGCAAGA 
GTTTTGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACTTACAAG 
GCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATT 

25 CACTCCCAAAGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTAC 
TTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTT 
GGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAA 
GGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGGATGGATGACCCGGAG 
AGAGAAGTGTTAGAGTGGAGGTTTGACAGCCACCTAGCATTTCATCACGTGGCCCGA 

30 GAGCTGCATCCGGAGTACTTCAAGAACTGCTGA [SEQ ID NO: 52] 

MGARASVLSG GELDRWEKIR LRPGGKKKYK LKHIVWASRE LERFAVNPGL 
LETSEGCRQI LGQLQPSLQT GSEELRSLYN TVATLYCVHQ RIEIKDTKEA 
LDKIEEEQNK SKKKAQQAAA DTGHSNQVSQ NYPIVQNIQG QMVHQAISPR 

35 TLNAWVKWE EKAFSPEVIP MFSALSEGAT PQDLNTMLNT VGGHQAAMQM 
LKETINEEAA EWDRVHPVHA GPIAPGQMRE PRGSDIAGTT STLQEQIGWM 
TNNPPIPVGE lYKRWIILGL NKIVRMYSPT SILDIRQGPK EPFRDYVDRF 
YKTLRAEQAS QEVKNWMTET LLVQNANPDC KTILKALGPA ATLEEMMTAC 
■ QGVGGPGHKA RVLVGFPVTP QVPLRPMTYK AAVDLSHFLK EKGGLEGLIH 

40 SQRRQDILDL WIYHTQGYFP DWQNYTPGPG VRYPLTFGWC YKLVPVEPDK 
VEEANKGENT SLLHPVSLHG MDDPEREVLE WRFDSHLAFH HVARELHPEY 
FKNC* 

[SEQ ID NO: 53] 
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Figure 4 

Sequence of tlie pl7/24opt/trNef insert in 

pl7/24opt/trNef 1 

5 ATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGAGAGCTGGACAGATGGGAGAAAATT 
AGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCATATCGTGTGGGCCTCG 
AGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACATCTGAGGGATGT 
CGCCAGATCCTGGGGCAATTGCAGCCATCCCTCCAGACCGGGAGTGAAGAGCTGAGG 
TCCTTGTATAACACAGTGGCTACCCTCTACTGCGTACACCAGAGGATCGAGATTAAG 

1 0 GATACCAAGGAGGCCTTGGACAAAATTGAGGAGGAGCAAAACAAGAGCAAGAAGAAG 
GCCCAGCAGGCAGCTGCTGACACTGGGCATAGCAACCAGGTATCACAGAACTATCCT 
ATTGTCGAAAACATTCAGGGGCAGATGGTTCATCAGGCCATCAGCCCCCGGACGCTC 
AATGCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCCATG 
TTCTCCGCTTTGAGTGAGGGGGCCACTCCTCAGGACCTCAATACAATGCTTAATACC 

1 5 GTGGGCGGCCATCAGGCCGCCATGCAAATGTTGAAGGAGACTATCAACGAGGAGGCA 
GCCGAGTGGGACAGAGTGCATCCCGTCCACGCTGGCCCAATCGCGCCCGGACAGATG 
CGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACTGCAAGAGCAAATC 
GGATGGATGACCAACAATCCTCCCATCCCAGTTGGAGAAATCTATAAACGGTGGATC 
ATTCTCGGTCTCAATAAAATTGTTAGAATGTACTCTCCGACATCCATCGTTGACATT 

20 AGACAGGGACCCAAAGAGCCTTTTAGGGATTACGTCGACCGGTTTTATAAGACCCTG 
CGAGCAGAGGAGGCCTCTCAGGAGGTCAAAAACTGGATGACGGAGACACTCCTGGTA 
CAGAACGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAGGCCCGGCTGCCACC 
CTGGAAGAGATGATGACCGCCTGTCAGGGAGTAGGCGGACCCGGACACAAAGCCAGA 
GTGTTGATGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACTTAC 

25 AAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTA 
ATTCACTCCCAAAGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGC 
TACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACC 
TTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAAT 
AAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGGATGGATGACCCG 

30 GAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCGAGCTAGCATTTCATCACGTGGCC 
CGAGAGCTGCATCCGGAGTACTTCAAGAACTGCTGA [SEQ ID NO: 54] 

MGARASVLSG GELDRWEKIR LRPGGKKKYK LKHIVWASRE LERFAVNPGL 

LETSEGCRQI LGQLQPSLQT GSEELRSLYN TVATLYCVHQ RIEIKDTKEA 

35 LDKIEEEQNK SKKKAQQAAA DTGHSNQVSQ NYPIVQNIQG QMVHQAISPR 

TLNAWVKWE EKAFSPEVIP MFSALSEGAT PQDLNTMLNT VGGHQAAMQM 

LKETINEEAA EWDRVHPVHA GPIAPGQMRE PRGSDIAGTT STLQEQIGWM 

T3SINPPIPVGE lYKRWIILGL NKIVRMYSPT SILDIRQGPK EPFRDYVDRF 

YKTLRAEQAS QEVKNWMTET LLVQNANPDC KTILKALGPA ATLEEMMTAC 

40 QGVGGPGHKA RVLMVGFPVT PQVPLRPMTY KAAVDLSHFL KEKGGLEGLI 

HSQRRQDILD LWIYHTQGYF PDWQNYTPGP GVRYPLTFGW CYKLVPVEPD 

KVEEANKGEN TSLLHPVSLH GMDDPEREVL EWRFDSHLAF HHVARELHPE 
YFKNC* 

[SEQ ID MO: 55] 
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6062^1 

4960.BBp1-U11 



4840p<inL 
483Cxbal 



4566,Bc«.. 

4452Avrll. 
443a,BamHL::: 
4393,PmaCI 




,988 
Nml,1023 



.Sse8387.1243 
ABspMI,1246 
^* Ball,1276 
• .BsrQM309 

,.Spel,1365 



*Ndel.1698 



403S.Kpi , 



Bacilli 8 

Bpu1102,2184 



3296,S«eAr / 
3106,Munl' 



* Ail 1 1.2597 
'Hpa1,27a0 
Mattel 
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Figure 5 

Sequence of RT insert of p7077-RT3: 

5 ATGGGCCCCATCAGTCCCATCGAGACCGTGCCGGTGAAGCTGAAACCCGGGATGGAC 
GGCCCCAAGGTCAAGCAGTGGCCACTCACCGAGGAGAAGATCAAGGCCCTGGTGGAG 
ATCTGCACCGAGATGGAGAAAGAGGGCAAGATCAGCAAGATCGGGCCTGAGAACCCA 
TACAACACCCCCGTGTTTGCCATCAAGAAGAAGGACAGCACCAAGTGGCGCAAGCTG 
GTGGATTTCCGGGAGCTGAATAAGCGGACCCAGGATTTCTGGGAGGTCCAGCTGGGC 

10 ATCCCCCATCCGGCCGGCCTGAAGAAGAAGAAGAGCGTGACCGTGCTGGACGTGGGC 
GACGCTTACTTCAGCGTCCCTCTGGACGAGGACTTTAGAAAGTACACCGCCTTTACC 
ATCCCATCTATCAACAACGAGACCCCTGGCATCAGATATCAGTACAACGTCCTCCCC 
CAGGGCTGGAAGGGCTCTCCCGCCATTTTCCAGAGCTCCATGACCAAGATCCTGGAG 
CCGTTTCGGAAGCAGAACCCCGATATCGTCATCTACCAGTACATGGACGACCTGTAC 

15 GTGGGCTCTGACCTGGAAATCGGGCAGCATCGCACGAAGATTGAGGAGCTGAGGCAG 
CATCTGCTGAGATGGGGCCTGACCACTCCGGACAAGAAGCATCAGAAGGAGCCGCCA 
TTCCTGTGGATGGGCTACGAGCTCCATCCCGACAAGTGGACCGTGCAGCCTATCGTC 
CTCCCCGAGAAGGACAGGTGGACCGTGAACGACATCCAGAAGCTGGTGGGCAAGCTC 
AACTGGGCTAGCCAGATCTATCCCGGGATCAAGGTGCGCCAGCTCTGCAAGCTGCTG 

20 CGCGGCACCAAGGCCCTGACCGAGGTGATTCCCCTCACGGAGGAAGCCGAGCTCGAG 
CTGGCTGAGAACCGGGAGATCCTGAAGGAGCCCGTGCACGGCGTGTACTATGACCCC 
TCCAAGGACCTGATCGCCGAAATCCAGAAGCAGGGCCAGGGGCAGTGGACATACCAG 
ATTTACCAGGAGCCTTTCAAGAACCTCAAGACCGGCAAGTACGCCCGCATGAGGGGC 
GCCCACACCAACGATGTCAAGCAGCTGACCGAGGCCGTCCAGAAGATCACGACCGAG 

25 TCCATCGTGATCTGGGGGAAGACACCCAAGTTCAAGCTGCCTATCCAGAAGGAGACC 
TGGGAGACGTGGTGGACCGAATATTGGCAGGCCACCTGGATTCCCGAGTGGGAGTTC 
GTGAATACACCTCCTCTGGTGAAGCTGTGGTACCAGCTCGAGAAGGAGCCCATCGTG 
GGCGCGGAGACATTCTACGTGGACGGCGCGGCCAACCGCGAAACAAAGCTCGGGAAG 
GCCGGGTACGTCACCAACCGGGGCCGCCAGAAGGTCGTCACCCTGACCGACACCACC 

30 AACCAGAAGACGGAGCTGCAGGCCATCTATCTCGCTCTCCAGGACTCCGGCCTGGAG 
GTGAACATCGTGACGGACAGCCAGTACGCGCTGGGCATTATTCAGGCCCAGCCGGAC 
CAGTCCGAGAGCGAACTGGTGAACCAGATTATCGAGCAGCTGATCAAGAAAGAGAAG 
GTCTACCTCGCCTGGGTCCCGGCCCATAAGGGCATTGGCGGCAACGAGCAGGTCGAC 
AAGCTGGTGAGTGCGGGGATTAGAAAGGTGCTGTAA [SEQ ID NO : 56] 

35 

MGPISPIETV SVKLKPGMDG PKVKQWPLTE EKIKALVEIC TEMEKEGKIS 
KIGPENPYNT PVFAIKKKDS TKWRKLVDFR ELNKRTQDFW EVQLGIPHPA 
GliKKKKSVTV LDVGDAYFSV PLDEDFRKYT AFTIPSINNE TPGIRYQYNV 
LPQGWKGSPA IFQSSMTKIL EPFRKQNPDI VIYQYMDDLY VGSDLEIGQH 

40 rtkieelrqh llrwglttpd kkhqkeppfl wmgyelhpdk wtvqpivlpe 
kdswtvndiq klvgklnwas qiypgikvrq lckllrgtka ltevipltee 
AELELjAENRE ilkepvhgvy ydpskdliae iqkqgqgqwt yqiyqepfkn 

LKTGKYARMR GAHTNDVKQL TEAVQKITTE SIVIWGKTPK FKLPIQKETW 
ETWWTEYWQA TWIPEWEFVN TPPLVKLWYQ LEKEPIVGAE TFYVDGAANR 
45 ETKLGKAGYV TNRGRQKWT LTDTTNQKTE LQAIYLALQD SGLEVNIVTD 
SQYALGIIQA QPDQSESELV NQIIEQLIKK EKVYIiAWVPA HKGIGGNEQV 
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.Stul,56 



5233,Drdl 

5131,BspLU11.. 

S011,Xmnl. 
5007^31.;.. 



.Safl.680 
/.Pvul,681 



4623AvrlL 
4609.BamHI.::s 



4202.Kpnl/ 
4202,Asp718 



4053,Tthr 
4000.Nari;.'i 
4000.Kasl;V 
4000,Ehel/ 
4000,Bbel 




.Clal.988 
•'..Nrul.1023 



.Sse8387,1243 

,.BsfGI,130g 

...Spel.i365 



"Ndet.isge 



'Sacll,2018 

Bpul 102^1 84 



3724,Nher 



3217.r«igoM! 
3217.Nael 
3216,Fser 



Aflll,2697 
Hpal,2780 
Scal,2B10 
N0tl,2911 
Apal,2922 
Bsp120l,2922 



5 



10 



15 



20 
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Figure 6 

Sequence of the coding insert in p73i-RT3: 

5 ATGGGCCCCATCAGTCCCATCGAGACCGTGCCGGTGAAGCTGAAACCCGGGATGGAC 
GGCCCCAAGGTCAAGCAGTGGCCACTCACCGAGGAGAAGATCAAGGCCCTGGTGGAG 
ATCTGCACCGAGATGGAGAAAGAGGGCAAGATCAGCAAGATCGGGCCTGAGAACCCA 
TACAACACCCCCGTGTTTGCCATCAAGAAGAAGGACAGCACCAAGTGGCGCAAGCTG 
GTGGATTTCCGGGAGCTGAATAAGCGGACCCAGGATTTCTGGGAGGTCCAGCTGGGG 

10 ATCCCCCATCCGGCCGGCCTGAAGAAGAAGAAGAGCGTGACCGTGCTGGACGTGGGC 
GACGCTTACTTCAGCGTCCCTCTGGACGAGGACTTTAGAAAGTACACCGCCTTTACC 
ATCCCATCTATCAACAACGAGACCCCTGGCATCAGATATCAGTACAACGTCCTCCCC 
CAGGGCTGGAAGGGCTCTCCCGCCATTTTCCAGAGCTCCATGACCAAGATCCTGGAG 
CCGTTTCGGAAGCAGAACCCCGATATCGTCATCTACCAGTACATGGACGACCTGTAC 

1 5 GTGGGCTCTGACCTGGAAATCGGGCAGCATCGCACGAAGATTGAGGAGCTGAGGCAG 
CATCTGCTGAGATGGGGCCTGACCACTCCGGACAAGAAGCATCAGAAGGAGCCGCCA 
TTCCTGTGGATGGGCTACGAGCTCCATCCCGACAAGTGGACCGTGCAGCCTATCGTC 
CTCCCCGAGAAGGACAGCTGGACCGTGAACGACATCCAGAAGCTGGTGGGCAAGCTC 
AACTGGGCTAGCCAGATCTATCCCGGGATCAAGGTGCGCCAGCTCTGCAAGCTGCTG 

20 CGCGGCACCAAGGCCCTGACCGAGGTGATTCCCCTCACGGAGGAAGCCGAGCTCGAG 
CTGGCTGAGAACCGGGAGATCCTGAAGGAGCCCGTGCACGGCGTGTACTATGACCCC 
TCCAAGGACCTGATCGCCGAAATCCAGAAGCAGGGCCAGGGGCAGTGGACATACCAG 
ATTTACCAGGAGCCTTTCAAGAACCTCAAGACCGGCAAGTACGCCCGCATGAGGGGC 
GCCCACACCAACGATGTCAAGCAGCTGACCGAGGCCGTCCAGAAGATCACGACCGAG 

25 TCCATCGTGATCTGGGGGAAGACACCCAAGTTCAAGCTGCCTATCCAGAAGGAGACC 
TGGGAGACGTGGTGGACCGAATATTGGCAGGCCACCTGGATTCCCGAGTGGGAGTTC 
GTGAATACACCTCCTCTGGTGAAGCTGTGGTACCAGCTCGAGAAGGAGCCCATCGTG 
GGCGCGGAGACATTCTACGTGGACGGCGCGGCCAACCGCGAAACAAAGCTCGGGAA 
GGCCGGGTACGTCACCAACCGGGGCCGCCAGAAGGTCGTCACCCTGACCGACACCAC 

30 CAACCAGAAGACGGAGCTGCAGGCCATCTATCTCGCTCTCCAGGACTCCGGCCTGGA 
GGTGAACATCGTGACGGACAGCCAGTACGCGCTGGGCATTATTCAGGCCCAGCCGGA 
CCAGTCCGAGAGCGAACTGGTGAACCAGATTATCGAGCAGCTGATCAAGAAAGAGAA 
GGTCTACCTCGCCTGGGTCCCGGCCCATAAGGGCATTGGCGGCAACGAGCAGGTCGA 

CAAGCTGGTGAGTGCGGGGATTAGAAAGGTGCTGTAA [SEQ ID NO: 58] 

35 

MGPISPIETV SVKLKPGMDG PKVKQWPLTE EKIKALVEIC TEMEKEGKIS 
KIGPENPYNT PVFAIKKKDS TKWRKLVDFR ELNKRTQDFW EVQLGIPHPA 
GLKKKKSVTV LDVGDAYFSV PLDEDFRKYT AFTIPSIMNE TPGIRYQYNV 
LPQGWKGSPA IFQSSMTKIL EPFRKQNPDI VIYQYMDDLY VGSDLEIGQH 

40 RTKIEELRQH LLRWGLTTPD KKHQKEPPFL WMGYELHPDK WTVQPIVLPE 
KDSWTVNDIQ KLVGKLNWAS QIYPGIKVRQ LCKliLRGTKA LTEVIPLTEE 
AELELAENRE ILKEPVHGVY YDPSKDLIAE IQKQGQGQWT YQIYQEPFKN 
LKTGKYARMR GAHTNDVKQL TEAVQKITTE SIVIWGKTPK FKLiPIQKETW 
ETWWTEYWQA TWIPEWEFVN TPPLVKLWYQ LEKEPIVGAE TFYVDGAANR 

45 ETKLGKAGYV TNRGRQKWT LTDTTNQKTE LQAIYLALQD SGLEVNIVTD 
SQYALGIIQA QPDQSESELV NQIIEQLIKK EKVYLAWVPA HKGIGGNEQV 
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DKLVSAGIRK VL* 
[SEQ ID NO: 59] 



.Stul,56 



4270,Drdl. 

4168.BspLU11., 
40dg.BsrBI.. 

39a8,EcoRI.,. 
3916,Mlul. 



36S9,Kpai..- 
3e89.Xmnl 



3556,BamHI 
3514.Sall'/ 
3510,BapMI /; 
3476.SanDI / 
3446,Bcll 



.HIndlll,S60 




..Clal,988 
,.Nnil,1Q23 



.Sphl,1238 
,-;.'.Sse8387.1243 



'Ndel.isgo 

'SnaBl.1495 
Ncol,1517 



3149.Kp 
3149,Asp718 



1609 



Bsp120l,ia69 



2947,Narl* 
2947,Kast; 
2947,Eheli 
2947,Bbel 



2e71,Nhsl 



BspEI.2520 



\FseI,2183 
;N3el,2164 
NgoMI^164 
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Sequence of Nef insert in 7077trNei20 

ATGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACTTACAAGGCA 
5 GCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATTCAC 
TCCCAAAGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTC 
CCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTTGGA 
TGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAAGGA 
GAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGGATGGATGACCCGGAGAGA 
10 GAAGTGTTAGAGTGGAGGTTTGACAGCCACCTAGCATTTCATCACGTGGCCCGAGAG 
CTGCATCCGGAGTACTTCAAGAACTGCTGA [SEQ ID NO : 60] 

MVGFPVTPQV PLRPMTYKAA VDLSHFLKEK GGLEGLIHSQ RRQDILDLWI 
YHTQGYFPDW QNYTPGPGVR YPLTFGWCYK LVPVEPDKVE EANKGENTSL 
15 LHPVSLHGMD DPEREVLEWR FDSVIiAFHHV ARELHPEYFK NC* 
[SEQ ID NO: 61] 



20 



25 



30 

Figure 8 
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Sequence of RT insert in 7077RT8 



ATGGGCCCCATTAGCCCTATTGAGACTGTGTCAGTAAAATTAAAGCCAGGAATGGAT 
GGCCCAAAAGTTAAACAATGGCCATTGACAGAAGAAAAAATAAAAGCATTAGTAGAA 
ATTTGTACAGAGATGGAAAAGGAAGGGAAAATTTCAAAAATTGGGCCTGAAAATCCA 
5 TACAATACTCCAGTATTTGCCATAAAGAAAAAAGACAGTACTAAATGGAGAAAATTA 
GTAGATTTCAGAGAACTTAATAAGAGAACTCAAGAGTTCTGGGAAGTTCAATTAGGA 
ATACCACATCCCGCAGGGTTAAAAAAGAAAAAATCAGTAACAGTACTGGATGTGGGT 
GATGCATATTTTTCAGTTCCCTTAGATGAAGACTTCAGGAAATATACTGCATTTACC 
ATAGCTAGTATAAACAATGAGACACCAGGGATTAGATATCAGTACAATGTGCTTCCA 

1 0 CAGGGATGGAAAGGATCACCAGCAATATTCCAAAGTAGCATGACAAAAATCTTAGAG 
CCTTTTAGAAAACAAAATCCAGACATAGTTATCTATCAATACATGGATGATTTGTAT 
GTAGGATCTGACTTAGAAATAGGGCAGCATAGAACAAAAATAGAGGAGCTGAGACAA 
CATCTGTTGAGGTGGGGACTTACCACACCAGACAAAAAACATCAGAAAGAACCTCCA 
TTCCTTTGGATGGGTTATGAACTCCATCCTGATAAATGGACAGTACAGCCTATAGTG 

1 5 CTGCCAGAAAAAGACAGCTGGACTGTCAATGACATACAGAAGTTAGTGGGGAAATTG 
AATTGGGCAAGTCAGATTTACCCAGGGATTAAAGTAAGGCAATTATGTAAACTCCTT 
AGAGGAACCAAAGCACTAACAGAAGTAATACCACTAACAGAAGAAGCAGAGCTAGAA 
CTGGCAGAAAACAGAGAGATTCTAAAAGAACCAGTACATGGAGTGTATTATGACCCA 
TCAAAAGACTTAATAGCAGAAATACAGAAGCAGGGGCAAGGCCAATGGACATATCAA 

20 ATTTATCAAGAGCCATTTAAAAATCTGAAAACAGGAAAATATGCAAGAATGAGGGGT 
GCCCACACTAATGATGTAAAACAATTAACAGAGGCAGTGCAAAAAATAACCACAGAA 
AGCATAGTAATATGGGGAAAGACTCCTAAATTTAAACTGCCCATACAAAAGGAAACA 
TGGGAAACATGGTGGACAGAGTATTGGCAAGCCACCTGGATTCCTGAGTGGGAGTTT 
GTTAATACCCCTCCCTTAGTGAAATTATGGTACCAGTTAGAGAAAGAACCCATAGTA 

25 GGAGCAGAAACCTTCTATGTAGATGGGGCAGCTAACAGGGAGACTAAATTAGGAAAA 
GCAGGATATGTTACTAATAGAGGAAGACAAAAAGTTGTCACCCTAACTGACACAACA 
AATCAGAAGACTGAGTTACAAGCAATTTATCTAGCTTTGCAGGATTCGGGATTAGAA 
GTAAACATAGTAACAGACTCACAATATGCATTAGGAATCATTCAAGCACAACCAGAT 
CAAAGTGAATCAGAGTTAGTCAATCAAATAATAGAGCAGTTAATAAAAAAGGAAAAG 

3 0 GTCTATCTGGCATGGGTACCAGCACACAAAGGAATTGGAGGAAATGAACAAGTAGAT 
AAATTAGTCAGTGCTGGAATCAGGAAAGTACTATTTTTAGATTAA 
[SEQ ID NO: 62] 



MGPISPIETV SVKLKPGMDG PKVKQWPLTE EKIKALVEIC TEMEKEGKIS 
35 KIGPENPYNT PVFAIKKKDS TKWRKLVDFR ELNKRTQDFW EVQLGIPHPA 
GLKKKKSVTV LDVGDAYFSV PLDEDFRKYT AFTIPSINNE TPGIRYQYNV 
LPQGWKGSPA IFQSSMTKIL EPFRKQNPDI VIYQYMDDLY VGSDLEIGQH 
RTKIEELRQH LLRWGLTTPD KKHQKEPPFL WMGYELHPDK WTVQPIVLPE 
KDSWTVNDIQ KLVGKLNWAS QIYPGIKVRQ LCKLLRGTKA LTEVIPLTEE 
40 AELELAENRE ILKEPVHGVY YDPSKDLIAE IQKQGQGQWT YQIYQEPFKN 
LKTGKYARMR GAHTNDVKQL TEAVQKITTE SIVIWGKTPK FKLPIQKETW 
ETWWTEYWQA TWIPEWEFVN TPPLVKLWYQ LEKEPIVGAE TFYVDGAANR 
ETKLGKAGYV TNRGRQKWT LTDTTNQKTE LQAIYLALQD SGLEVNIVTD 
SQYALGIIQA QPDQSESELV NQIIEQLIKK EKVYLAWVPA HKGIGGNEQV 
45 DKLVSAGIRK VLFLD* 
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[SEQ ID NO: 53] 



5 



10 



15 



20 



25 
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Figure 9 

Sequence of the pi 7/24opt/RT/trNef insert in pl7/24opt/RT/trNen3 

ATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGAGAGCTGGACAGATGGGAGAAAATT 

AGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCATATCGTGTGGGCCTCG 

AGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACATCTGAGGGATGT 

CGCCAGATCCTGGGGCAATTGCAGCCATCCCTCCAGACCGGGAGTGAAGAGCTGAGG 

TCCTTGTATAACACAGTGGCTACCCTCTACTGCGTACACCAGAGGATCGAGATTAAG 

GATACCAAGGAGGCCTTGGACAAAATTGAGGAGGAGCAAAACAAGAGCAAGAAGAAG 

GCCCAGCAGGCAGCTGCTGACACTGGGCATAGCAACCAGGTATCACAGAACTATCCT 

ATTGTCCAAAACATTCAGGGCCAGATGGTTCATCAGGCCATCAGCCCCCGGACGCTC 

AATGCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCCATG 

TTCTCCGCTTTGAGTGAGGGGGCCACTCCTCAGGACCTCAATACAATGCTTAATACC 

GTGGGCGGCCATCAGGCCGCCATGCAAATGTTGAAGGAGACTATCAACGAGGAGGCA 

GCCGAGTGGGACAGAGTGCATCCCGTCCACGCTGGCCCAATCGCGCCCGGACAGATG 

CGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACTGCAAGAGCAAATC 

GGATGGATGACCAACAATCCTCCCATCCCAGTTGGAGAAATCTATAAACGGTGGATC 

ATTCTCGGTCTCAATAAAATTGTTAGAATGTACTCTCCGACATCCATCCTTGACATT 

AGACAGGGACCCAAAGAGCCTTTTAGGGATTACGTCGACCGGTTTTATAAGACCCTG 

CGAGCAGAGCAGGCCTCTCAGGAGGTCAAAAACTGGATGACGGAGACACTCCTGGTA 

CAGAACGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAGGCCCGGCTGCCACC 

CTGGAAGAGATGATGACCGCCTGTCAGGGAGTAGGCGGACCCGGACACAAAGCCAGA 

GTGTTGATGGGCCCCATTAGCCCTATTGAGACTGTGTCAGTAAAATTAAAGCCAGGA 

ATGGATGGCCCAAAAGTTAAACAATGGCCATTGACAGAAGAAAAAATAAAAGCATTA 

GTAGAAATTTGTACAGAGATGGAAAAGGAAGGGAAAATTTCAAAAATTGGGCCTGA?^ 

AATCCATACAATACTCCAGTATTTGCCATAAAGAAAAAAGACAGTACTAAATGGAGA 

AAATTAGTAGATTTCAGAGAACTTAATAAGAGAACTCAAGACTTCTGGGAAGTTCAA 

TTAGGAATACCACATCCCGCAGGGTTAAAAAAGAAAAAATCAGTAACAGTACTGGAT 

GTGGGTGATGCATATTTTTCAGTTCCCTTAGATGAAGACTTCAGGAAATATACTGCA 

TTTACCATACCTAGTATAAACAATGAGACACCAGGGATTAGATATCAGTACAATGTG 

CTTCCACAGGGATGGAAAGGATCACCAGCAATATTCCAAAGTAGCATGACAAAAATC 

TTAGAGCCTTTTAGAAAACAAAATCCAGACATAGTTATCTATCAATACATGGATGAT 

TTGTATGTAGGATCTGACTTAGAAATAGGGCAGCATAGAACAAAAATAGAGGAGCTG 

AGACAACATCTGTTGAGGTGGGGACTTACCACACCAGACAA?iAAACATCAGAAAGAA 

CCTCCATTCCTTTGGATGG6TTATGAACTCCATCCTGATAAATGGACAGTACAGCCT 

ATAGTGCTGCCAGAAAAAGACAGCTGGACTGTCAATGACATACAGAAGTTAGTGGGG 

AAATTGAATTGGGCAAGTCAGATTTACCCAGGGATTAAAGTAAGGCAATTATGTAAA 

CTCCTTAGAGGAACCAAAGCACTAACAGAAGTAATACCACTAACAGAAGAAGCAGAG 

CTAGAACTGGCAGAAAACAGAGAGATTCTAAAAGAACCAGTACATGGAGTGTATTAT 

GACCCATCAAAAGACTTAATAGCAGAAATACAGAAGCAGGGGCAAGGCCAATGGACA 

TATCAAATTTATCAAGAGCCATTTAAAAATCTGAAAACAGGAAAATATGCAAGAATG 

AGGGGTGCCCACACTAATGATGTAAAACAATTAACAGAGGCAGTGCAAAAAATAACC 

ACAGAAAGCATAGTAATATGGGGAAAGACTCCTAAATTTAAACTGCCCATACAAAAG 

GAAACATGGGAAACATGGTGGACAGAGTATTGGCAAGCCACCTGGATTCCTGAGTGG 
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GAGTTTGTTAATACCCCTCCCTTAGTGAAATTATGGTACCAGTTAGAGAAAGAACCC 
ATAGTAGGAGCAGAAACCTTCTATGTAGATGGGGCAGCTAACAGGGAGACTAAATTA 
GGAAAAGCAGGATATGTTACTAATAGAGGAAGACAAAAAGTTGTCACCCTAACTGAC 
ACAACAAATCAGAAGACTGAGTTACAAGCAATTTATCTAGCTTTGCAGGATTCGGGA 
5 TTAGAAGTAAACATAGTAACAGACTCACAATATGCATTAGGAATCATTCAAGCACAA 
CCAGATCAAAGTGAATCAGAGTTAGTCAATCAAATAATAGAGCAGTTAATAAAAAAG 
GAAAAGGTCTATCTGGCATGGGTACCAGCACACAAAGGAATTGGAGGAAATGAACAA 
GTAGATAAATTAGTCAGTGCTGGAATCAGGAAAGTACTATTTTTAGATATGGTGGGT 
TTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACTTACAAGGCAGCTGTAGAT 

1 0 CTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATTCACTCCCAAAGA 
AGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTCCCTGATTGG 
CAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTTGGATGGTGCTAC 
AAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAAGGAGAGAACACC 
AGCTTGTTACACCCTGTGAGCCTGCATGGGATGGATGACCCGGAGAGAGAAGTGTTA 

15 GAGTGGAGGTTTGACAGCCACCTAGCATTTCATCACGTGGCCCGAGAGCTGCATCCG 
GAGTACTTCAAGAACTGCTGA [SEQ ID NO: 64] 



MGARASVLSG GELDRWBKIR LRPGGKKKYK LiKHIVWASRE IiERFAVNPGL 
liETSEGCRQI LGQLQPSLQT GSEELRSLYN TVATLYCVHQ RIEIKDTKEA 

20 LDKIEEEQNK SKKKAQQAAA DTGHSNQVSQ NYPIVQNIQG QMVHQAISPR 
TLiNAWVKWE EKAFSPEVIP MFSALSEGAT PQDLNTMLNT VGGHQAAMQM 
LKETINEEAA EWDRVHPVHA GPIAPGQMRE PRGSDIAGTT STLQEQIGWM 
TNNPPIPVGE lYKRWIILGL NKIVRMYSPT SILDIRQGPK EPFRDYVDRF 
YKTLRAEQAS QEVKNWMTET LLVQNANPDC KTILKALGPA ATLEEMMTAC 

25 QGVGGPGHKA RVLMGPISPI ETVSVKLKPG MDGPKVKQWP LTEEKIKALV 
EICTEMEKEG KISKIGPENP YNTPVFAIKK KDSTKWRKLV DFRELNKRTQ 
DFWEVQLGIP HPAGLKKKKS VTVLDVGDAY FSVPLDEDFR KYTAFTIPSI 
NNETPGIRYQ YNVLPQGWKG SPAIFQSSMT KILEPFRKQN PDIVIYQYMD 
DLYVGSDLEI GQHRTKIEEL RQHLLRWGLT TPDKKHQKEP PFLWMGYELH 

30 PDKWTVQPIV LPEKDSWTVN DIQKLVGKLN WASQIYPGIK VRQLCKLLRG 
TKALTEVIPL TEEAELELAE NREILKEPVH GVYYDPSKDL lAEIQKQGQG 
QWTYQIYQEP FKNLKTGKYA RMRGAHTNDV KQLTEAVQKI TTESIVIWGK 
TPKFKLPIQK ETWETWWTEY WQATWIPEWE FVNTPPLVKL WYQLEKEPIV 
GAETFYVDGA ANRETKLGKA GYVTNRGRQK WTLTDTTNQ KTELQAIYIoA 

35 LQDSGLEVNI VTDSQYALGI IQAQPDQSES ELVNQIIEQL IKKEKVYLAW 
VPAHKGIGGN EQVDKLVSAG IRKVLFLDMV GFPVTPQVPL RPMTYKAAVD 
LSHFLKEKGG LEGLIHSQRR QDILDLWIYH TQGYFPDWQN YTPGPGVRYP 
LTFGWCYKLV PVEPDKVEEA NKGENTSLLH PVSLHGMDDP EREVLEWRFD 
SHLAFHHVAR ELHPEYFKNC * 

40 [SEQ ID NO: 65] 
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Figure 10 

Sequence of the pl7/p24opt(cor)/RT/trNef coding insert in WRG7077: 

ATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGAGAGCTGGACAGATGGGAGAAAATT 
5 AGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCATATCGTGTGGGCCTCG 
AGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACATCTGAGGGATGT 
CGCCAGATCCTGGGGCAATTGCAGCCATCCCTCCAGACCGGGAGTGAAGAGCTGAGG 
TCCTTGTATAACACAGTGGCTACCCTCTACTGCGTACACCAGAGGATCGAGATTAAG 
GATACCAAGGAGGCCTTGGACAAAATTGAGGAGGAGCAAAACAAGAGCAAGAAGAAG 

10 GCCCAGCAGGCAGCTGCTGACACTGGGCATAGCAACCAGGTATCACAGAACTATCCT 
ATTGTCCAAAACATTCAGGGCCAGATGGTTCATCAGGCCATCAGCCCCCGGACGCTC 
AATGCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCCATG 
TTCTCCGCTTTGAGTGAGGGGGCCACTCCTCAGGACCTCAATACAATGCTTAATACC 
GTGGGCGGCCATCAGGCCGCCATGCAAATGTTGAAGGAGACTATCAACGAGGAGGCA 

15 GCCGAGTGGGACAGAGTGCATCCCGTCCACGCTGGCCCAATCGCGCCCGGACAGATG 
CGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACTGCAAGAGCAAATC 
GGATGGATGACCAACAATCCTCCCATCCCAGTTGGAGAAATCTATAAACGGTGGATC 
ATCCTGGGCCTGAACAAGATCGTGCGCATGTACTCTCCGACATCCATCCTTGACATT 
AGACAGGGACCCAAAGAGCCTTTTAGGGATTACGTCGACCGGTTTTATAAGACCCTG 

20 CGAGCAGAGCAGGCCTCTCAGGAGGTCAAAAACTGGATGACGGAGACAGTCCTGGTA 
CAGAACGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAGGCCCGGCTGCCACC 
CTGGAAGAGATGATGACCGCCTGTCAGGGAGTAGGCGGACCCGGACACAAAGCCAGA 
GTGTTGATGGGCCCCATTAGCCCTATTGAGACTGTGTCAGTAAAATTAAAGCCAGGA 
ATGGATGGCCCAAAAGTTAAACAATGGCCATTGACAGAAGAAAAAATAAAAGCATTA 

25 GTAGAAATTTGTACAGAGATGGAAAAGGAAGGGAAAATTTCAAAAATTGGGCCTGAA 
AATCCATACAATACTCCAGTATTTGCCATAAAGAAAAAAGACAGTACTAAATGGAGA 
AAATTAGTAGATTTCAGAGAACTTAATAAGAGAACTCAAGACTTCTGGGAAGTTCAA 
TTAGGAATACCACATCCCGCAGGGTTAAAAAAGAAAAAATCAGTAACAGTACTGGAT 
GTGGGTGATGCATATTTTTCAGTTCCCTTAGATGAAGACTTCAGGAAATATACTGCA 

30 TTTACCATACCTAGTATAAACAATGAGACACCAGGGATTAGATATCAGTACAATGTG 
CTTCCACAGGGATGGAAAGGATCACCAGCAATATTCCAAAGTAGCATGACAAAAATC 
TTAGAGCCTTTTAGAAAACAAAATCCAGACATAGTTATCTATCAATACATGGATGAT 
TTGTATGTAGGATCTGACTTAGAAATAGGGCAGCATAGAACAAAAATAGAGGAGCTG 
AGACAACATCTGTTGAGGTGGGGACTTACCACACCAGACAAAAAACATCAGAAAGAA 

35 CCTCCATTCCTTTGGATGGGTTATGAACTCCATCCTGATAAATGGACAGTACAGCCT 
ATAGTGCTGCCAGAAAAAGACAGCTGGACTGTCAATGACATACAGAAGTTAGTGGGG 
AAATTGAATTGGGCAAGTCAGATTTACCCAGGGATTAAAGTAAGGCAATTATGTAAA 
CTCCTTAGAGGAACCAAAGCACTAACAGAAGTAATACCACTAACAGAAGAAGCAGAG 
CTAGAACTGGCAGAAAACAGAGAGATTCTAAAAGAACCAGTACATGGAGTGTATTAT 

40 GACCCATCAAAAGACTTAATAGCAGAAATACAGAAGCAGGGGCAAGGCCAATGGACA 
TATCAAATTTATCAAGAGCCATTTAAAAATCTGAAAACAGGAAAATATGCAAGAATG 
AGGGGTGCCCACACTAATGATGTAAAACAATTAACAGAGGCAGTGCAAAAAATAACC 
ACAGAAAGCATAGTAATATGGGGAAAGACTCCTAAATTTAAACTGCCCATACAAAAG 
GAAACATGGGAAACATGGTGGACAGAGTATTGGCAAGCCACCTGGATTCCTGAGTGG 

45 GAGTTTGTTAATACCCCTCCCTTAGTGAAATTATGGTACCAGTTAGAGAAAGAACCC 
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ATAGTAGGAGCAGAAACCTTCTATGTAGATGGGGCAGCTAACAGGGAGACTAAATTA 
GGAAAAGCAGGATATGTTACTAATAGAGGAAGACAAAAAGTTGTCACCCTAACTGAC 
ACAACAAATCAGAAGACTGAGTTACAAGCAATTTATCTAGCTTTGCAGGATTCGGGA 
TTAGAAGTAAACATAGTAACAGACTCACAATATGCATTAGGAATCATTCAAGCACAA 
5 CCAGATCAAAGTGAATCAGAGTTAGTCAATCAAATAATAGAGCAGTTAATAAAAAAG 
GAAAAGGTCTATCTGGCATGGGTACCAGCACACAAAGGAATTGGAGGAAATGAACAA 
GTAGATAAATTAGTCAGTGCTGGAATCAGGAAAGTACTATTTTTAGATATGGTGGGT 
TTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACTTACAAGGCAGCTGTAGAT 
CTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATTCACTCCCAAAGA 

10 AGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTCCCTGATTGG 
CAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTTGGATGGTGCTAC 
AAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAAGGAGAGAACACC 
AGCTTGTTACACCCTGTGAGCCTGCATGGGATGGATGACCCGGAGAGAGAAGTGTTA 
GAGTGGAGGTTTGACAGCCACCTAGCATTTCATCACGTGGCCCGAGAGCTGCATCCG 

15 GAGTACTTCAAGAACTGCTGA [SEQ ID NO: 66] 



MGARASVLSG GELDRWEKIR LRPGGKKKYK LKHIVWASRE LERFAVNPGL 
LETSEGCRQI LGQLQPSLQT GSEELRSLYN TVATLYCVHQ RIEIKDTKEA 

20 LDKIEEEQNK SKKKAQQAAA DTGHSNQVSQ NYPIVQNIQG QMVHQAISPR 
TLNAWVKWE EKAFSPEVIP MFSALSEGAT PQDLNTMLNT VGGHQAAMQM 
LKETINEEAA EWDRVHPVHA GPIAPGQMRE PRGSDIAGTT STLQEQIGWM 
TNNPPIPVGE lYKRWIIIiGL NKIVRMYSPT SILDIRQGPK EPFRDYVDRF 
YKTLRAEQAS QEVKNWMTET LLVQNANPDC KTILKALGPA ATLEEMMTAC 

25 QGVGGPGHKA RVLMGPISPI ETVSVKLKPG MDGPKVKQWP LTEEKIKALV 
EICTEMEKEG KISKIGPENP YNTPVFAIKK KDSTKWRKLV DFRELNKRTQ 
DFWEVQLGIP HPAGLKKKKS VTVLDVGDAY FSVPLDEDFR KYTAFTIPSI 
NNETPGIRYQ YNVLPQGWKG SPAIFQSSMT KILEPFRKQN PDIVIYQYMD 
DLYVGSDLEI GQHRTKIEEL RQHLLRWGLT TPDKKHQKEP PFLWMGYELH 

30 PDKWTVQPIV LPEKDSWTVN DIQKLVGKLN WASQIYPGIK VRQLCKLLRG 
TKALTEVIPL TB-EAELELAE NREILKEPVH GVYYDPSKDL lAEIQKQGQG 
QWTYQIYQEP FKNLKTGKYA RMRGAHTNDV KQLTEAVQKI TTESIVIWGK 
TPKFKLPIQK ETWETWWTEY WQATWIPEWE FVNTPPLVKL WYQLEKEPIV 
GAETFYVDGA ANRETKLGKA GYVTNRGRQK WTLTDTTNQ KTELQAIYLA 

35 LQDSGLEVNI VTDSQYALGI IQAQPDQSES ELVNQIIEQL IKKEKVTIjAW 
VPAHKGIGGN EQVDKLVSAG IRKVLFLDMV GFPVTPQVPL RPMTYKAAVD 
LiSHFLKEKGG LEGLIHSQRR QDILDLWIYH TQGYFPDWQN YTPGPGVRYP 
LTFGWCYKLV PVEPDKVEEA NKGENTSLLH PVSLHGMDDP EREVLEWRFD 
SHLAFHHVAR EliHPEYFKNC * 

40 [SEQ ID NO: 67] 
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Figure 11 

Sequence of the pl7/p24(opt)/RT(opt)trNef insert in p73i-GRN2: 

ATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGAGAGCTGGACAGATGGGA 
5 GAAAATTAGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCAT 
ATCGTGTGGGCCTCGAGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCT 
GCTGGAAACATCTGAGGGATGTCGCCAGATCCTGGGGCAATTGCAGCCAT 
CCCTCCAGACCGGGAGTGAAGAGCTGAGGTCCTTGTATAACACAGTGGCT 
ACCCTCTACTGCGTACACCAGAGGATCGAGATTAAGGATACCAAGGAGGC 

1 0 CTTGGAC AAAATTGAGGAGGAGCAAAACAAGAGCAAGAAGAAGGCCCAG 
CAGGCAGCTGCTGACACTGGGCATAGCAACCAGGTATCACAGAACTATCC 
TATTGTCCAAAACATTCAGGGCCAGATGGTTCATCAGGCCATCAGCCCCC 
GGACGCTCAATGCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCT 
GAGGTTATCCCCATGTTCTCCGCTTTGAGTGAGGGGGCCACTCCTCAGGAC 

1 5 CTCAATAC AATGCTTAATACCGTGGGCGGCCATCAGGCCGCCATGCAAAT 
GTTGAAGGAGACTATCAACGAGGAGGCAGCCGAGTGGGACAGAGTGCAT 
CCCGTCCACGCTGGCCCAATCGCGCCCGGACAGATGCGGGAGCCTCGCGG 
CTCTGACATTGCCGGCACCACCTCTACACTGCAAGAGCAAATCGGATGGA 
TGACCAACAATCCTCCCATCCCAGTTGGAGAAATCTATAAACGGTGGATC 

20 ATCCTGGGCCTGAACAAGATCGTGCGCATGTACTCTCCGACATCCATCCTT 
GACATTAGACAGGGACCCAAAGAGCCTTTTAGGGATTACGTCGACCGGTT 
TTATAAGACCCTGCGAGCAGAGCAGGCCTCTCAGGAGGTCAAAAACTGGA 
TGACGGAGACACTCCTGGTACAGAACGCTAACCCCGACTGCAAAACAATC 
TTGAAGGCACTAGGCCCGGCTGCCACCCTGGAAGAGATGATGACCGCCTG 

25 TCAGGGAGTAGGCGGACCCGGACACAAAGCCAGAGTGTTGATGGGCCCC 
ATCAGTCCCATCGAGACCGTGCCGGTGAAGCTGAAACCCGGGATGGACGG 
CCCCAAGGTCAAGCAGTGGCCACTCACCGAGGAGAAGATCAAGGCCCTG 
GTGGAGATCTGCACCGAGATGGAGAAAGAGGGCAAGATCAGCAAGATCG 
GGCCTGAGAACCCATACAACACCCCCGTGTTTGCCATCAAGAAGAAGGAC 

30 AGCACCAAGTGGCGCAAGCTGGTGGATTTCCGGGAGCTGAATAAGCGGAC 
CCAGGATTTCTGGGAGGTCCAGCTGGGCATCCCCCATCCGGCCGGCCTGA 
AGAAGAAGAAGAGCGTGACCGTGCTGGACGTGGGCGACGCTTACTTCAGC 
GTCCCTCTGGACGAGGACTTTAGAAAGTACACCGCCTTTACCATCCCATCT 
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ATCAACAACGAGACCCCTGGCATCAGATATCAGTACAACGTCCTCCCCCA 
GGGCTGGAAGGGCTCTCCCGCCATTTTCCAGAGCTCCATGACCAAGATCC 
TGGAGCCGTTTCGGAAGCAGAACCCCGATATCGTCATCTACCAGTACATG 
GACGACCTGTACGTGGGCTCTGACCTGGAAATCGGGCAGCATCGCACGAA 
5 GATTGAGGAGCTGAGGCAGCATCTGCTGAGATGGGGCCTGACCACTCCGG 
ACAAGAAGCATCAGAAGGAGCCGCCATTCCTGTGGATGGGCTACGAGCTC 
CATCCCGACAAGTGGACCGTGCAGCCTATCGTCCTCCCCGAGAAGGACAG 
CTGGACCGTGAACGACATCCAGAAGCTGGTGGGCAAGCTCAACTGGGCTA 
GCCAGATCTATCCCGGGATCAAGGTGCGCCAGCTCTGCAAGCTGCTGCGC 

1 0 GGCACCAAGGCCCTGACCGAGGTGATTCCCCTCACGGAGGAAGCCGAGCT 
CGAGCTGGCTGAGAACCGGGAGATCCTGAAGGAGCCCGTGCACGGCGTGT 
ACTATGACCCCTCCAAGGACCTGATCGCCGAAATCCAGAAGCAGGGCCAG 
GGGCAGTGGACATACCAGATTTACCAGGAGCCTTTCAAGAACCTCAAGAC 
CGGCAAGTACGCCCGCATGAGGGGCGCCCACACCAACGATGTCAAGCAG 

1 5 CTGACCGAGGCCGTCCAGAAGATCACGACCGAGTCCATCGTGATCTGGGG 
GAAGACACCCAAGTTCAAGCTGCCTATCCAGAAGGAGACCTGGGAGACGT 
GGTGGACCGAATATTGGCAGGCCACCTGGATTCCCGAGTGGGAGTTCGTG 
AATACACCTCCTCTGGTGAAGCTGTGGTACCAGCTCGAGAAGGAGCCCAT 
CGTGGGCGCGGAGACATTCTACGTGGACGGCGCGGCCAACCGCGAAACA 

20 AAGCTCGGGAAGGCCGGGTACGTCACCAACCGGGGCCGCCAGAAGGTCG 
TCACCCTGACCGACACCACCAACCAGAAGACGGAGCTGCAGGCCATCTAT 
CTCGCTCTCCAGGACTCCGGCCTGGAGGTGAACATCGTGACGGACAGCCA 
GTACGCGCTGGGCATTATTCAGGCCCAGCCGGACCAGTCCGAGAGCGAAC 
TGGTGAACCAGATTATCGAGCAGCTGATCAAGAAAGAGAAGGTCTACCTC 

25 GCCTGGGTCCCGGCCCATAAGGGCATTGGCGGCAACGAGCAGGTCGACAA 
GCTGGTGAGTGCGGGGATTAGAAAGGTGCTGATGGTGGGTTTTCCAGTCA 
CACCTCAGGTACCTTTAAGACCAATGACTTACAAGGCAGCTGTAGATCTT 
AGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATTCACTCCCA 
AAGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTACT 

30 TCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTG 
ACCTTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGA 
AGAGGCCAATAAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGC 
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ATGGGATGGATGACCCGGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAG 
CCGCCTAGCATTTCATCACGTGGCCCGAGAGCTGCATCCGGAGTACTTCA 

AGAACTGCTGA [SEQ ID NO: 68] 



5 MGARASVLSG GELDRWEKIR LRPGGKKKYK LKHIVWASRE LERFAVNPGL 

LETSEGCRQI LGQLQPSLQT GSEELRSLYN TVATLYCVHQ RIEIKDTKEA 

LDKIEEEQNK SKKKAQQAAA DTGHSNQVSQ NYPIVQNIQG QMVHQAISPR 

TLNAWVKWE EKAFSPEVIP MFSALSEGAT PQDLNTMLNT VGGHQAAMQM 

LKETINEEAA EWDRVHPVHA GPIAPGQMRE PRGSDIAGTT STLQEQIGWM 

10 TNNPPIPVGE lYKRWIILGL NKIVRMYSPT SILDIRQGPK EPFRDYVDRF 

YKTLRAEQAS QEVKNWMTET LLVQMANPDC KTILKALGPA ATLEEMMTAC 

QGVGGPGHKA RVLMGPISPI ETVSVKLKPG MDGPKVKQWP LTEEKIKALV 

EICTEMEKEG KISKIGPEMP YNTPVFAIKK KDSTKWRKLV DFRELNKRTQ 

DFWEVQLGIP HPAGLKKKKS VTVLDVGDAY FSVPLDEDFR KYTAFTIPSI 

15 ISnSFETPGIRYQ YNVLPQGWKG SPAIFQSSMT KILEPFRKQN PDIVIYQYMD 

DLYVGSDLEI GQHRTKIEEL RQHLLRWGLT TPDKKHQKEP PFLWMGYELH 

PDKWTVQPIV LPEKDSWTVN DIQKLVGKLN WASQIYPGIK VRQLCKLLRG 

TKALTEVIPL TEEAELEIiAE NREILKEPVH GVYYDPSKDL lAEIQKQGQG 

QWTYQIYQEP FKNLKTGKYA RMRGAHTNDV KQLTEAVQKI TTESIVIWGK 

20 TPKFKLPIQK ETWETWWTEY WQATWIPEWE FVNTPPLVKL WYQLEKEPIV 

GAETFYVDGA ANRETKLGKA GYVTNRGRQK WTLTDTTNQ KTELQAIYLA 

LQDSGLEVNI VTDSQYALGI IQAQPDQSES ELVNQIIEQL IKKEKVYLAW 

VPAHKGIGGN EQVDKLVSAG IRKVLMVGFP VTPQVPLRPM TYKAAVDLSH 

FLKEKGGLEG LIHSQRRQDI LDLWIYHTQG YFPDWQNYTP GPGVRYPLTF 

25 GWCYKLVPVE PDKVEEANKG ENTSLLHPVS LHGMDDPERE VLEWRFDSRL 
AFHHVARELH PEYFKNC* 
[SEQ ID NO: 69] 
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Figure 12 

Sequence of the pi 7/p24opt/trNef insert in p73i-GN2: 

ATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGAGAGCTGGACAGATGGGAGAAAATT 
5 AGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCATATCGTGTGGGCCTCG 
AGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACATCTGAGGGATGT 
CGCCAGATCCTGGGGCAATTGCAGCCATCCCTCCAGACCGGGAGTGAAGAGCTGAGG 
TCCTTGTATAACACAGTGGCTACCCTCTACTGCGTACACCAGAGGATCGAGATTAAG 
GATACCAAGGAGGCCTTGGACAAAATTGAGGAGGAGCAAAACAAGAGCAAGAAGAAG 

10 GCCCAGCAGGCAGCTGCTGACACTGGGCATAGCAACCAGGTATCACAGAACTATCCT 
ATTGTCCAAA?^CATTCAGGGCCAGATGGTTCATCAGGCCATCAGCCCCCGGACGCTC 
AATGCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCCATG 
TTCTCCGCTTTGAGTGAGGGGGCCACTCCTCAGGACCTCAATACAATGCTTAATACC 
GTGGGCGGCCATCAGGCCGCCATGCAAATGTTGAAGGAGACTATCAACGAGGAGGCA 

15 GCCGAGTGGGACAGAGTGCATCCCGTCCACGCTGGCCCAATCGCGCCCGGACAGATG 
CGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACTGCAAGAGCAAATC 
GGATGGATGACCAACAATCCTCCCATCCCAGTTGGAGAAATCTATAAACGGTGGATC 
ATTCTCGGTCTCAATAAAATTGTTAGAATGTACTCTCCGACATCCATCCTTGACATT 
AGACAGGGACCCAAAGAGCCTTTTAGGGATTACGTGGACCGGTTTTATAAGACCCTG 

20 CGAGCAGAGCAGGCCTCTCAGGAGGTCAAAAACTGGATGACGGAGACACTCCTGGTA 
CAGAACGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAGGCCCGGCTGCCACC 
CTGGAAGAGATGATGACCGCCTGTCAGGGAGTAGGCGGACCCGGACACAAAGCCAGA 
GTGTTGATGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACTTAC 
AAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTA 

25 ATTCACTCCCAAAGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGC 
TACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACC 
TTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAAT 
AAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGGATGGATGACCCG 
GAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCACGTGGCC 

30 CGAGAGCTGCATCCGGAGTACTTCAAGAACTGCTGA [SEQ ID NO : 70] 



MGARASVLSG GELDRWEKIR LRPGGKKKYK LKHIVWASRE LERFAVNPGL 
LETSEGCRQI LGQLQPSLQT GSEELRSLYN TVATLYCVHQ RIEIKDTKEA 

35 LDKIEEEQNK SKKKAQQAAA DTGHSNQVSQ NYPIVQNIQG QMVHQAISPR 
TLNAWVKWE EKAFSPEVIP MFSALSEGAT PQDLNTMLNT VGGHQAAMQM 
LKETINEEAA EWDRVHPVHA GPIAPGQMRE PRGSDIAGTT STLQEQIGWM 
TNNPPIPVGE lYKRWIILGL NKIVRMYSPT SILDIRQGPK EPFRDYVDRF 
YKTLRAEQAS QEVKNWMTET LLVQNANPDC KTILKALGPA ATLEEMMTAC 

40 QGVGGPGHKA RVLMVGFPVT PQVPLRPMTY KAAVDLSHFL KEKGGLEGLI 
HSQRRQDILD LiWIYHTQGYF PDWQNYTPGP GVRYPLTFGW CYKLVPVEPD 
KVEEANKGEN TSLLHPVSLiH GMDDPEREVL EWRFDSRLiAP HHVARELHPE 
YFKNC* [SEQ ID NO : 71] 
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4311,ApaLI. 



.H]ndlll,5B0 



40g9,Drdl., 

3997.BspUU11.. 
3928,BsrBl. 

3817.ECORI... 
3745.MIU1 



.Sspl,755 
/' EcoNi,765 



3528,HpaI,/ 
3518,Xmnl 



3385.BamHr..:< 
3365.Scaiy/ 
3360,BspE) / 
3340.PmaCI 




.CIaI,98B 
..Nrul.1023 



V..SphI,1238 
\Sse8387,1243 
Pstl.1244 

'*"Ndel,1390 
SnaBI,14SS 



3018.Bglll V 
2982,KpnI/ 
2982,Asp718 

2907.Ahdl 

2759.AgeI - 
2755.Sa1I; 
27S5>\ccl 
2727.SanDI 
2671,Bsal 

2575,NgoMli 
2575,Nael 



■'.Ed 138. 1721 
Sacl.1721 
Sacll.1809 
Notl.18S8 



Mlinl.2053 



SexAt,2243 
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Figure 13 

Sequence of RT w229k: 

ATGGGCCCCATCAGTCCCATCGAGACCGTGCCGGTGAAGCTGAAACCCGGGATGGACGGCCC 
5 CAAGGTCAAGCAGTGGCCACTCACCGAGGAGAAGATCAAGGCCCTGGTGGAGATCTGC^ 
AGATGGAGAAAGAGGGC7y^GATCAGCAAGATCGGGCCTGAGAACCCATA(^^ 
TTTGCCATCAAGAAGAAGGACAGCACCAAGTGGCGCAAGCTGGTGGATTTCCGGGAGCTGAA 
TAAGCGGACCCAGGATTTCTGGGAGGTCCAGCTGGGCATCCCCCATCCGGCCGGCCTGAAGA 
AGAAGAAGAGCGTGACCGTGCTGGACGTGGGCGACGCTTACTTCAGCGTCCCTCTGGACGAG 

10 GACTTTAGAAAGTACACCGCCTTTACCATCCCATCTATCAACAACGAGACCCCTGGCATCAG 
ATATCAGTACAACGTCCTCCCCCAGGGCTGGAAGGGCTCTCCCGCCATTTTCCAGAGCTCCA 
TGACCAAGATCCTGGAGCCGTTTCGGAAGCAGAACCCCGATATCGTCATCTACCAGTACATG 
GACGACCTGTACGTGGGCTCTGACCTGGAAATCGGGCAGCATCGCACGAAGATTGAGGAGCT 
GAGGCAGCATCTGCTGAGATGGGGCCTGACCACTCCGGACAAGAAGCATCAGAAGGAGCCGC 

15 CATTCCTgaaGATGGGCTACGAGCTCCATCCCGACAAGTGGACCGTGCAGCCTATCGTCCTC 
CCCGAGAAGGACAGCTGGACCGTGAACGACATCCAGAAGCTGGTGGGCAAGCTCAACTGGGC 
TAGCCAGATCTATCCCGGGATCAAGGTGCGCCAGCTCTGCAAGCTGCTGCGCGGCACCAAGG 
CCCTGACCGAGGTGATTCCCCTCACGGAGGAAGCCGAGCTCGAGCTGGCTGAGAACCGGGAG 
ATCCTGAAGGAGCCCGTGCACGGCGTGTACTATGACCCCTCCAAGGACCTGATCGCCGAAAT 

20 CCAGAAGCAGGGCCAGGGGCAGTGGACATACCAGATTTACCAGGAGCCTTTCAAGAACCTCA 
AGACCGGCAAGTACGCCCGCATGAGGGGCGCCCACACCAACGATGTCAAGCAGCTGACCGAG 
GCCGTCCAGAAGATCACGACCGAGTCCATCGTGATCTGGGGGAAGACACCCAAGTTCAAGCT 
GCCTATCCAGAAGGAGACCTGGGAGACGTGGTGGACCGAATATTGGCAGGCCACCTGGATTC 
CCGAGTGGGAGTTCGTGAATACACCTCCTCTGGTGAAGCTGTGGTACCAGCTCGAGAAGGAG 

25 CCCATCGTGGGCGCGGAGACATTCTACGTGGACGGCGCGGCCAACCGCGAAACAAAGCTCGG 
GAAGGCCGGGTACGTCACCAACCGGGGCCGCCAGAAGGTCGTCACCCTGACCGACACCACCA 
ACCAGAAGACGGAGCTGCAGGCCATCTATCTCGCTCTCCAGGACTCCGGCCTGGAGGTGAAC 
ATCGTGACGGACAGCCAGTACGCGCTGGGCATTATTCAGGCCCAGCCGGACCAGTCCGAGAG 
CGAACTGGTGAACCAGATTATCGAGCAGCTGATCAAGAAAGAGAAGGTCTACCTCGCCTGGG 
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TCCCGGCCCATAAGGGCATTGGCGGCAACGAGCAGGTCGACAAGCTGGTGAGTGCGGGGATT 
AGAAAGGTGCTGTAA [SEQ ID NO: 72] 



,StuI 



Hindin 



DrdI 



BspLUl 1 
BsrBI., 



EcoRI. 
Mlul.., 



Hpal/ 
XmnI 

BamHI . 
Sail/; 
BspMl'// 
SanDI / 
Bell' 




..SphI 
.•;:',Sse8387 



Ndcl 

/SnaBI 
*NcoI 



Kpnl/ 
Asp718 



i\ SacII 
\.NotI 
\ApaI 
Bspl20I 
Ball 



BstXI 



^Nael 
NgoMI 



Bsu36 



BspEI 
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.StuI 



PflMI 

.Hindlll 
.Sspl 
EcoNI 



AlwNI. 
ApaLI. 



Drdl. 
BspLUll. 
BsrBI. 
Sapl.. ' 
EcoRI. ' 
Miul.. 



Smal 
(Xmal 
,ClaI 
f.NruI 
" Xhol 



HpaL, 
Xmnl..v 

BamHI. 
BsaAI...ir 
Dram 



Bbsl 




.Drain 
/.SphI 
/iSse8387 
/|.PstI 
,BsaAI 
/.SnaBI 
I^NcoI 
i^.Styl 
'" MslI 



Ecll36-H- 
"Sacn-H- 



Smal-H- 



Gsul 
++BbsI 
BsaAI 

-H-Sspl / 
Bbsl 



Styl 

4+ Xmal 

■Bcll36" , 
AlwNI 
BsaAI 
PflMI 
Gsul 
Sad 
Ecll36 
EcoNI 



Stul++ 
Gsul 
Sapl 
\?iStyI 
\ WStuI 
\ \ styl 
\ AlwNI 
StuI 
MsU 



iSmal 
Xmal 
Styl 
MslI 
MslI 
Drain 



Sapl 



Sequence: 

ATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGAGAGCTGGACAGATGGGAGAAAATT 
AGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCATATCGTGTGGGCCTCG 
AGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACATCTGAGGGATGT 
CGCCAGATCCTGGGGCAATTGCAGCCATCCCTCCAGACCGGGAGTGAAGAGCTGAGG 
TCCTTGTATAACACAGTGGCTACCCTCTACTGCGTACACCAGAGGATCGAGATTAAG 
GATACCAAGGAGGCCTTGGACAAAATTGAGGAGGAGCAAAACAAGAGCAAGAAGAAG 
GCCCAGCAGGCAGCTGCTGACACTGGGCATAGCAACCAGGTATCACAGAACTATCCT 
ATTGTCCAAAACATTCAGGGCCAGATGGTTCATCAGGCCATCAGCCCCCGGACGCTC 
AATGCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCCATG 
TTCTCCGCTTTGAGTGAGGGGGCCACTCCTCAGGACCTCAATACAATGCTTAATACC 
GTGGGCGGCCATCAGGCCGCCATGCAAATGTTGAAGGAGACTATCAACGAGGAGGCA 
GCCGAGTGGGACAGAGTGCATCCCGTCCACGCTGGCCCAATCGCGCCCGGACAGATG 
CGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACTGCAAGAGCAAATC 
GGATGGATGACCAACAATCCTCCCATCCCAGTTGGAGAAATCTATAAACGGTGGATC 
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ATCCTGGGCCTGAACAAGATCGTGCGCATGTACTCTCCGACATCCATCCTTGACATT 
AGACAGGGACCCAAAGAGCCTTTTAGGGATTACGTCGACCGGTTTTATAAGACCCTG 
CGAGCAGAGCAGGCCTCTCAGGAGGTCAAAAACTGGATGACGGAGACACTCCTGGTA 
CAGAACGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAGGCCCGGCTGCCACC 
5 CTGGAAGAGATGATGACCGCCTGTCAGGGAGTAGGCGGACCCGGACACAAAGCCAGA 
GTGTTGATGGGCCCCATCAGTCCCATCGAGACCGTGCCGGTGAAGCTGAAACCCGGG 
ATGGACGGCCCCAAGGTCAAGCAGTGGCCACTCACCGAGGAGAAGATCAAGGCCCTG 
GTGGAGATCTGCACCGAGATGGAGAAAGAGGGCAAGATCAGCAAGATCGGGCCTGAG 
AACCCATACAACACCCCCGTGTTTGCCATCAAGAAGAAGGACAGCACCAAGTGGCGC 

10 AAGCTGGTGGATTTCCGGGAGCTGAATAAGCGGACCCAGGATTTCTGGGAGGTCCAG 
CTGGGCATCCCCCATCCGGCCGGCCTGAAGAAGAAGAAGAGCGTGACCGTGCTGGAC 
GTGGGCGACGCTTACTTCAGCGTCCCTCTGGACGAGGACTTTAGAAAGTACACCGGC 
TTTACCATCCCATCTATCAACAACGAGACCCCTGGCATCAGATATCAGTACAACGTC 
CTCCCCCAGGGCTGGAAGGGCTCTCCCGCCATTTTCCAGAGCTCCATGACCAAGATC 

15 CTGGAGCCGTTTCGGAAGCAGAACCCCGATATCGTCATCTACCAGTACATGGACGAC 
CTGTACGTGGGCTCTGACCTGGAAATCGGGCAGCATCGCACGAAGATTGAGGAGCTG 
AGGCAGCATCTGCTGAGATGGGGCCTGACCACTCCGGACAAGAAGCATCAGAAGGAG 
CCGCCATTCCTgaaGATGGGCTACGAGCTCCATCCCGACAAGTGGACCGTGCAGCCT 
ATCGTCCTCCCCGAGAAGGACAGCTGGACCGTGAACGACATCCAGAAGCTGGTGGGC 

20 AAGCTCAACTGGGCTAGCCAGATCTATCCCGGGATCAAGGTGCGCCAGCTCTGCAAG 
CTGCTGCGCGGCACCAAGGCCCTGACCGAGGTGATTCCCCTCACGGAGGAAGCCGAG 
CTCGAGCTGGCTGAGAACCGGGAGATCCTGAAGGAGCCCGTGCACGGCGTGTACTAT 
GACCCCTCCAAGGACCTGATCGCCGAAATCCAGAAGCAGGGCCAGGGGCAGTGGACA 
TACCAGATTTACCAGGAGCCTTTCAAGAACCTCAAGACCGGCAAGTACGCCCGCATG 

25 AGGGGCGCCCACACCAACGATGTCAAGCAGCTGACCGAGGCCGTCCAGAAGATCACG 
ACCGAGTCCATCGTGATCTGGGGGAAGACACCCAAGTTCAAGCTGCCTATCCAGAAG 
GAGACCTGGGA6ACGTGGTGGACCGAATATTGGCAGGCCACCTGGATTCCCGAGTGG 
GAGTTCGTGAATACACCTCCTCTGGTGAAGCTGTGGTACCAGCTCGAGAAGGAGCCC 
ATCGTGGGCGCGGAGACATTCTACGTGGACGGCGCGGCCAACCGCGAAACAAAGCTC 

30 GGGAAGGCCGGGTACGTCACCAACCGGGGCCGCCAGAAGGTCGTCACCCTGACCGAC 
ACCACCAACCAGAAGACGGAGCTGCAGGCCATCTATCTCGCTCTCCAGGACTCCGGC 
CTGGAGGTGAACATCGTGACGGACAGCCAGTACGCGCTGGGCATTATTCAGGCCCAG 
CCGGACCAGTCCGAGAGCGAACTGGTGAACCAGATTATCGAGCAGCTGATCAAGAAA 
GAGAAGGTCTACCTCGCCTGGGTCCCGGCCCATAAGGGCATTGGCGGCAACGAGCAG 

35 GTCGACAAGCTGGTGAGTGCGGGGATTAGAAAGGTGCTGATGGTGGGTTTTCCAGTC 
ACACCTCAGGTACCTTTAAGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCAC 
TTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATTCACTCCCAAAGAAGACAAGAT 
ATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTCCCTGATTGGCAGAACTAC 
ACACCAGGGCCAGGGGTCAGATATCCACTGACCTTTGGATGGTGCTACAAGCTAGTA 

40 CCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAAGGAGAGAACACCAGCTTGTTA 
CACCCTGTGAGCCTGCATGGGATGGATGACCCGGAGAGAGAAGTGTTAGAGTGGAGG 
TTTGACAGCCGCCTAGCATTTCATCACGTGGCCCGAGAGCTGCATCCGGAGTACTTC 
AAGAACTGCTGA [SEQ ID NO : 73] 

45 
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MGARASVLSG GELDRWEKIR LRPGGKKKYK LKHIVWASRE LERFAWPGL 
LETSEGCRQI LGQLQPSLQT GSEELRSLYN TVATLYCVHQ RIEIKDTKEA 
LDKIEEEQNK SKKKAQQAAA DTGHSNQVSQ NYPIVQNIQG QMVHQAISPR 
5 TLNAWVKWE EKAFSPEVIP MFSALSEGAT PQDLNTMLNT VGGHQAAMQM 
LKETINEEAA EWDRVHPVHA GPIAPGQMRE PRGSDIAGTT STLQEQIGWM 
TNNPPIPVGE lYKRWIILGL NKIVRMYSPT SILDIRQGPK EPFRDYVDRF 
YKTLRAEQAS QEVKNWMTET LLVQNANPDC KTILKALGPA ATLEEMMTAC 
QGVGGPGHKA RVLMGPISPI ETVSVKLKPG MDGPKVKQWP LTEEKIKALV 

10 EICTEMEKEG KISKIGPENP YNTPVFAIKK KDSTKWRKLV DFRELN"KRTQ 
DFWEVQLGIP HPAGLKKKKS VTVLDVGDAY FSVPLDEDFR KYTAFTIPSI 
NNETPGIRYQ YNVLPQGWKG SPAIFQSSMT KILEPFRKQN PDIVIYQYMD 
DLYVGSDLEI GQHRTKIEEL RQHLLRWGLT TPDKKHQKEP PPLWMGYELH 
PDKWTVQPIV LPEKDSWTVN DIQKLVGKLN WASQIYPGIK VRQLCKLLRG 

15 TKALTEVIPL TEEAELELAE NREILKEPVH GVYYDPSKDL lAEIQKQGQG 
QWTYQIYQEP FKNLKTGKYA RMRGAHTNDV KQLTEAVQKI TTESIVIWGK 
TPKFKLPIQK ETWETWWTEY WQATWIPEWE FVNTPPLVKL WYQLEKEPIV 
GAETFYVDGA ANRETKLGKA GYVTNRGRQK WTLTDTTNQ KTELQAIYLA 
LQDSGLEVNI VTDSQYALGI IQAQPDQSES ELVNQIIEQL IKKEKVYLAW 

20 VPAHKGIGGN EQVDKLVSAG IRKVLMVGFP VTPQVPLRPM TYKAAVDLSH 
FLKEKGGLEG LIHSQRRQDI LDLWIYHTQG YFPDWQNYTP GPGVRYPLTF 
GWCYKLVPVE PDKVEEANKG ENTSLLHPVS LHGMDDPERE VLEWRFDSRL 
AFHHVARELH PEYFKNC* 
[SEQ ID NO: 74] 

25 



30 



35 
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Figure 15 
Sequence of Tnrg: 

ATGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACTTACAAGGCA 

GCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATTCAC 

TCCCAAAGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTC 

CCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTTGGA 

TGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAAGGA 

GAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGGATGGATGACCCGGAGAGA 

GAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCACGTGGCCCGAGAG 

CTGCATCCGGAGTACTTCAAGAACTGCATGGGCCCCATCAGTCCCATCGAGACCGTG 

CCGGTGAAGCTGAAACCCGGGATGGACGGCCCCAAGGTCAAGCAGTGGCCACTCACC 

GAGGAGAAGATCAAGGCCCTGGTGGAGATCTGCACCGAGATGGAGAAAGAGGGCAAG 

ATCAGCAAGATCGGGCCTGAGAACCCATACAACACCCCCGTGTTTGCCATCAAGAAG 

AAGGACAGCACCAAGTGGCGCAAGCTGGTGGATTTCCGGGAGCTGAATAAGCGGACC 

GAGGATTTCTGGGAGGTCCAGCTGGGCATCCCCCATCCGGCCGGCCTGAAGAAGAAG 

AAGAGCGTGACCGTGCTGGACGTGGGCGACGCTTACTTCAGCGTCCCTCTGGACGAG 

GACTTTAGAAAGTACACCGCCTTTACCATCCCATCTATCAACAACGAGACCCCTGGC 

ATCAGATATCAGTACAA.CGTCCTCCCCCAGGGCTGGAAGGGCTCTCCCGCCATTTTC 

CAGAGCTCCATGACCAAGATCCTGGAGCCGTTTCGGAAGCAGAACCCCGATATCGTC 

ATCTACCAGTACATGGACGACCTGTACGTGGGCTCTGACCTGGAAATCGGGCAGCAT 

CGCACGAAGATTGAGGAGCTGAGGCAGCATCTGCTGAGATGGGGCCTGACCACTCCG 

GACAAGAAGCATCAGAAGGAGCCGCCATTCCTgaaGATGGGCTACGAGCTCCATCCC 

GACAAGTGGACCGTGCAGCCTATCGTCCTCCCCGAGAAGGACAGCTGGACCGTGAAC 

GACATCCAGAAGCTGGTGGGCAAGCTCAACTGGGCTAGCCAGATCTATCCCGGGATC 

AAGGTGCGCCAGCTCTGCAAGCTGCTGCGCGGCACCAAGGCCCTGACCGAGGTGATT 

CCCCTCACGGAGGAAGCCGAGCTCGAGCTGGCTGAGAACCGGGAGATCCTGAAGGAG 

CCCGTGCACGGCGTGTACTATGACCCCTCCAAGGACCTGATCGCCGAAATCCAGAAG 

CAGGGCCAGGGGCAGTGGACATACCAGATTTACCAGGAGCCTTTCAAGAACCTCAAG 

ACCGGCAAGTACGCCCGCATGAGGGGCGCCCACACCAACGATGTCAAGCAGCTGACC 

GAGGCCGTCCAGAAGATCACGACCGAGTCCATCGTGATCTGGGGGAAGACACCCAAG 

TTCAAGCTGCCTATCCAGAAGGAGACCTGGGAGACGTGGTGGACCGAATATTGGCAG 

GCCACCTGGATTCCCGAGTGGGAGTTCGTGAATACACCTCCTCTGGTGAAGCTGTGG 

TACCAGCTCGAGAAGGAGCCCATCGTGGGCGCGGAGACATTCTACGTGGACGGCGCG 

GCCAACCGCGAAACAAAGCTCGGGAAGGCCGGGTACGTCACCAACCGGGGCCGCCAG 

AAGGTCGTCACCCTGACCGACACCACCAACCAGAAGACGGAGCTGCAGGCCATCTAT 

CTCGCTCTCCAGGACTCCGGCCTGGAGGTGAACATCGTGACGGACAGCCAGTACGCG 

CTGGGCATTATTCAGGCCCAGCCGGACCAGTCCGAGAGCGAACTGGTGAACCAGATT 

ATCGAGCAGCTGATCAAGAAAGAGAAGGTCTACCTCGCCTGGGTCCCGGCCCATAAG 

GGCATTGGCGGCAACGAGCAGGTCGACAAGCTGGTGAGTGCGGGGATTAGAAAGGTG 

CTGATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGAGAGCTGGACAGATGGGAGAAA 

ATTAGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCATATCGTGTGGGCC 

TCGAGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACATCTGAGGGA 

TGTCGCCAGATCCTGGGGCAATTGCAGCCATCCCTCCAGACCGGGAGTGAAGAGCTG 

AGGTCCTTGTATAACACAGTGGCTACCCTCTACTGCGTACACCAGAGGATCGAGATT 
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AAGGATACCAAGGAGGCCTTGGACAAAATTGAGGAGGAGCAAAACAAGAGCAAGAAG 
AAGGCCCAGCAGGCAGCTGCTGACACTGGGCATAGCAACCAGGTATCACAGAACTAT 
CCTATTGTCCAAAACATTCAGGGCCAGATGGTTCATCAGGCCATCAGCCCCCGGACG 
CTCAATGCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCC 
5 ATGTTCTCCGCTTTGAGTGAGGGGGCCAGTCCTCAGGACCTCAATACAATGCTTAAT 
ACCGTGGGCGGCCATCAGGCCGCCATGCAAATGTTGAAGGAGACTATCAACGAGGAG 
GCAGCCGAGTGGGACAGAGTGCATCCCGTCCAGGCTGGCCCAATCGCGCCCGGACAG 
ATGCGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACTGCAAGAGCAA 
ATCGGATGGATGACCAACAATCCTCCCATCCCAGTTGGAGAAATCTATAAACGGTGG 

10 ATCATCCTGGGCCTGAACAAGATCGTGCGCATGTACTCTCCGACATCCATCCTTGAC 
ATTAGACAGGGACCCAAAGAGCCTTTTAGGGATTACGTCGACCGGTTTTATAAGACC 
CTGCGAGCAGAGCAGGCCTCTCAGGAGGTCAAAAACTGGATGACGGAGACACTCCTG 
GTACAGAACGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAGGCCCGGCTGCC 
ACCCTGGAAGAGATGATGACCGGCTGTCAGGGAGTAGGCGGACCCGGACACAAAGCC 

15 AGAGTGTTGTGA [SEQ ID NO: 75] 



MVGFPVTPQV PLRPMTYKAA 
YHTQGYFPDW QNYTPGPGVR 

20 LHPVSLHGMD DPEREVLEWR 
TVSVKLKPGM DGPKVKQWPL 
NTPVFAIKKK DSTKWRKLVD 
TVLDVGDAYF SVPLDEDFRK 
PAIFQSSMTK ILEPFRKQNP 

25 QHLLRWGLTT PDKKHQKEPP 
IQKLVGKLNW ASQIYPGIKV 
REILKEPVHG VYYDPSKDLI 
MRGAHTNDVK QLTEAVQKIT 
QATWIPEWEF VNTPPLVKLW 

30 YVTNRGRQKV VTLTDTTNQK 
QAQPDQSESE LVNQIIEQLI 
RKVLMGARAS VLSGGELDRW 
NPGLLiETSEG CRQILGQLQP 
TKEALDKIEE EQNKSKKKAQ 

35 ISPRTLNAWV KWEEKAFSP 
AMQMLKETIN EEAAEWDRVH 
IGWMTNNPPI PVGEIYKRWI 
VDRFYKTLRA EQASQEVKNW 



VDLSHFLKEK GGLEGLIHSQ RRQDILDLWI 
YPLTFGWCYK LVPVEPDICVE EANKGENTSL 
FDSRLAFHHV ARELHPEYFK NCMGPISPIE 
TEEKIKAIiVE ICTEMEKEGK ISKIGPENPY 
FRELNKRTQD FWEVQLGIPH PAGLKKKKSV 
YTAFTIPSIN NETPGIRYQY ISTVLPQGWKGS 
DIVIYQYMDD LYVGSDLEIG QHRTKIEELR 
FLWMGYELHP DKWTVQPIVL PEKDSWTVND 
RQLCKLLRGT KALiTEVIPLT EEAELELAEN 
AEIQKQGQGQ WTYQIYQEPF KNLKTGKYAR 
TESIVIWGKT PKFKLPIQKE TWETWWTEYW 
YQLEKEPIVG AETFYVDGAA NRETKLGKAG 
TELQAIYLAL QDSGLEVNIV TDSQYALGII 
KKEKVYLAWV PAHKGIGGNE QVDKLVSAGI 
EKIRLRPGGK KKYKLKHIVW ASRELERFAV 
SLQTGSEELR SLYNTVATLY CVHQRIEIKD 
QAAADTGHSN QVSQNYPIVQ NIQGQMVHQA 
EVIPMFSALS EGATPQDLNT MLNTVGGHQA 
PVHAGPIAPG QMREPRGSDI AGTTSTLQEQ 
ILGLNKIVRM YSPTSILDIR QGPKEPFRDY 
MTETLLVQNA NPDGKTILKA LGPAATLEEM 
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MTACQGVGGP GHKARVL* [SEQ ID NO: 76] 
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Figure 16 

Sequence of the Tngr insert in p7313ie: 

ATGGTGGGTTTTCCAGTCAGACCTCAGGTACCTTTAAGACCAATGACTTACAAGGCAGCTGT 
5 AGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATTCACTCCCAAAGAA 
GACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTCCCTGATTGGCAGAAC 
TACACACCAGGGCCAGGGGTCAGATATCGACTGACCTTTGGATGGTGCTACAAGCTAGTACC 
AGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAAGGAGAGAACACCAGCTTGTTACACCCTG 
TGAGCCTGCATGGGATGGATGACCCGGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGC 

10 CTAGCATTTCATCACGTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCATGGGTGC 
CCGAGCTTCGGTACTGTCTGGTGGAGAGCTGGACAGATGGGAGAAAATTAGGCTGCGCCCGG 
GAGGCAAAAAGAAATACAAGCTCAAGCATATCGTGTGGGCCTCGAGGGAGCTTGAACGGTTT 
GCCGTGAACCCA.GGCCTGCTGGAAACATCTGAGGGATGTCGCCAGATCCTGGGGCAATTGCA 
GCCATCCCTCCAGACCGGGAGTGAAGAGCTGAGGTCCTTGTATAACACAGTGGCTACCCTCT 

1 5 ACTGCGTACACCAGAGGATCGAGATTAAGGATACCAAGGAGGCCTTGGACAAAATTGAGGAG 
GAGCAAAACAAGAGCAAGAAGAAGGCCCAGCAGGCAGCTGCTGACACTGGGCATAGCAACCA 
GGTATCACAGAACTATCCTATTGTCCAAAACATTCAGGGCCAGATGGTTCATCAGGCCATCA 
GCCCCCGGACGCTCAATGCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTT 
ATCCCCATGTTCTCCGCTTTGAGTGAGGGGGCCACTCCTCAGGACCTCAATACAATGCTTAA 

20 TACCGTGGGCGGCCATCAGGCCGCCATGCAAATGTTGAAGGAGACTATCAACGAGGAGGCAG 
CCGAGTGGGACAGAGTGCATCCCGTCCACGCTGGCCCAATCGCGCCCGGACAGATGCGGGAG 
CCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACTGCAAGAGCAAATCGGATGGATGAC 
CAACAATCCTCCCATCCCAGTTGGAGAAATCTATAAACGGTGGATCATCCTGGGCCTGAACA 
AGATCGTGCGCATGTACTCTCCGACATCCATCCTTGACATTAGACAGGGACCCAAAGAGCCT 

25 TTTAGGGATTACGTCGACCGGTTTTATAAGACCCTGCGAGCAGAGCAGGCCTCTCAGGAGGT 
CAAAAACTGGATGACGGAGACACTCCTGGTACAGAACGCTAACCCCGACTGCAAAACAATCT 
TGAAGGCACTAGGCCCGGCTGCCACCCTGGAAGAGATGATGACCGCCTGTCAGGGAGTAGGG 
GGACCCGGACACAAAGCCAGAGTGTTGATGGGCCCCATCAGTCCCATCGAGACCGTGCCGGT 
GAAGCTGAAACCCGGGATGGACGGCCCCAAGGTCAAGCAGTGGCCACTCACCGAGGAGAAGA 

30 TCAAGGCCCTGGTGGAGATCTGCACCGAGATGGAGAAAGAGGGCAAGATCAGCAAGATCGGG 
CCTGAGAACCCATACAACACCCCCGTGTTTGCCATCAAGAAGAAGGACAGCACCAAGTGGCG 
CAAGCTGGTGGATTTCCGGGAGCTGAATAAGCGGACCCAGGATTTCTGGGAGGTCCAGCTGG 
GCATCCCCCATCCGGCCGGCCTGAAGAAGAAGAAGAGCGTGACCGTGCTGGACGTGGGCGAC 
GCTTACTTCAGCGTCCCTCTGGACGAGGACTTTAGAAAGTACACCGCCTTTACCATCCCATC 

35 TATCAACAACGAGACCCCTGGCATCAGATATCAGTACAACGTCCTCCCCCAGGGCTGGAAGG 
GCTCTCCCGCCATTTTCCAGAGCTCCATGACCAAGATCCTGGAGCCGTTTCGGAAGCAGAAC 
CCCGATATCGTCATCTACCAGTACATGGACGACCTGTACGTGGGCTCTGACCTGGAAATCGG 
GCAGCATCGCACGAAGATTGAGGAGCTGAGGCAGCATCTGCTGAGATGGGGCCTGACCACTC 
CGGACAAGAAGCATCAGAAGGAGCCGCCATTCCTgaaGATGGGCTACGAGCTCGATCCCGAC 

40 AAGTGGACCGTGCAGCCTATCGTCCTCCCCGAGAAGGACAGCTGGACCGTGAACGACATCCA 
GAAGCTGGTGGGCAAGCTCAACTGGGCTAGCCAGATCTATCCCGGGATCAAGGTGCGCCAGC 
TCTGCAAGCTGCTGCGCGGCACCAAGGCCCTGACCGAGGTGATTCCCCTCACGGAGGAAGCC 
GAGCTCGAGCTGGCTGAGAACCGGGAGATCCTGAAGGAGCCCGTGCACGGCGTGTACTATGA 
CCCCTCCAAGGACCTGATCGCCGAAATCCAGAAGCAGGGCCAGGGGCAGTGGACATACCAGA 

45 TTTACCAGGAGCCTTTCAAGAACCTCAAGACCGGCAAGTACGCCCGCATGAGGGGCGCCCAC 
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ACCAACGATGTCAAGCAGCTGACCGAGGCCGTCCAGAAGATCACGACCGAGTCCATCGTGAT 
CTGGGGGAAGACACCCAAGTTCAAGCTGCCTATCCAGAAGGAGACCTGGGAGACGTGGTGGA 
CCGAATATTGGCAGGCCACCTGGATTCCCGAGTGGGAGTTCGTGAATACACCTCCTCTGGTG 
AAGCTGTGGTACCAGCTCGAGAAGGAGCCCATCGTGGGCGCGGAGACATTCTACGTGGACGG 
5 CGCGGCCA^CCGCGAAACAAAGCTCGGGAAGGCCGGGTACGTCA.CCAACCGGGGCCGCCAGA 
AGGTCGTCACCCTGACCGACACCACCAACCAGAAGACGGAGCTGCAGGCCATCTATCTCGCT 
CTCCAGGACTCCGGCCTGGAGGTGAACATCGTGACGGACAGCCAGTACGCGCTGGGCATTAT 
TCAGGCCCAGCCGGACCAGTCCGAGAGCGAACTGGTGAACCAGATTATCGAGCAGCTGATCA 
AGAAAGAGAAGGTCTACCTCGCCTGGGTCCCGGCCCATAAGGGCATTGGCGGCAACGAGCAG 
10 GTCGACAAGCTGGTGAGTGCGGGGATTAGAAAGGTGCTGTAA [SEQ ID NO : 77] 



MVGFPVTPQV PLRPMTYKAA VDLSHFLKEK GGLEGLIHSQ RRQDILDLWI 
YHTQGYFPDW QNYTPGPGVR YPLTFGWCYK LVPVEPDKVE EANKGENTSL 

15 liHPVSLHGMD DPEREVLEWR FDSRLAFHHV ARELHPEYFK NCMGARASVL 
SGGELDRWEK IRLRPGGKKK YKLKHIVWAS RELERFAVNP GLLETSEGCR 
QILGQLQPSL QTGSEELRSL YNTVATLYCV HQRIEIKDTK EALDKIEEEQ 
NKSKKKAQQA AADTGHSNQV SQNYPIVQNI QGQMVHQAIS PRTLNAWVKV 
VEEKAFSPEV IPMFSALSEG ATPQDLNTML NTVGGHQAAM QMLKETINEE 

20 AAEWDRVHPV HAGPIAPGQM REPRGSDIAG TTSTLQEQIG WMTNNPPIPV 
GEIYKRWIIL GLNKIVRMYS PTSILDIRQG PKEPFRDYVD RFYKTLRAEQ 
ASQEVKNWMT ETLLVQNANP DCKTILKALG PAATLEEMMT ACQGVGGPGH 
KARVLMGPIS PIETVSVKLK PGMDGPKVKQ WPLTEEKIKA LVEICTEMEK 
EGKISKIGPE NPYNTPVFAI KKKDSTKWRK LVDFRELNKR TQDFWEVQLG 

25 IPHPAGLKKK KSVTVLDVGD AYFSVPLDED FRKYTAFTIP SIISINETPGIR 
YQYNVLPQGW KGSPAIFQSS MTKILEPFRK QNPDIVIYQY MDDLYVGSDL 
EIGQHRTKIE ELRQHLLRWG LTTPDKKHQK EPPFLWMGYE LHPDKWTVQP 
IVLPEKDSWT VNDIQKLVGK LNWASQIYPG IKVRQLCKLL RGTKALTEVI 
PLTEEAELEL AENREILKEP VHGVYYDPSK DLIAEIQKQG QGQWTYQIYQ 

30 EPFKNLKTGK YARMRGAHTN DVKQLTEAVQ KITTESIVIW GKTPKFKLPI 
QKETWETWWT EYWQATWIPE WEFVNTPPLV KLWYQLEKEP IVGAETFYVD 
GAANRETKLG KAGYVTNRGR QKWTLTDTT NQKTELQAIY IiALQDSGLEV 
NIVTDSQYAL GIIQAQPDQS ESELVNQIIE QLIKKEKVYL AWVPAHKGIG 
GNEQVDKLVS AGIRKVL* 

35 [SEQ ID NO: 78] 



40 
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Figure 17 

Sequence of insert Trgn #6 

ATGGGCCCCATCAGTCCCATCGAGACCGTGCCGGTGAAGCTGAAACCCGGGATGGAC 

GGCCCCAAGGTCAAGCAGTGGCCACTCACCGAGGAGAAGATCAAGGCCCTGGTGGAG 

ATCTGCACCGAGATGGAGAAAGAGGGCAAGATCAGCAAGATCGGGCCTGAGAACCCA 

TACAACACCCCCGTGTTTGCCATCAAGAAGAAGGACAGCACCAAGTGGCGCAAGCTG 

GTGGATTTCCGGGAGCTGAATAAGCGGACCCAGGATTTCTGGGAGGTCCAGCTGGGC 

ATCCCCCATCCGGCCGGCCTGAAGAAGAAGAAGAGCGTGACCGTGCTGGACGTGGGC 

GACGCTTACTTCAGCGTCCCTCTGGACGAGGACTTTAGAAAGTACACCGCCTTTACC 

ATCCCATCTATCAACAACGAGACCCCTGGCATCAGATATCAGTACAACGTCCTCCCC 

CAGGGCTGGAAGGGCTCTCCCGCCATTTTCCAGAGCTCCATGACCAAGATCCTGGAG 

CCGTTTCGGAAGCAGAACCCCGATATCGTCATCTACCAGTACATGGACGACCTGTAC 

GTGGGCTCTGACCTGGAAATCGGGCAGCATCGCACGAAGATTGAGGAGCTGAGGCAG 

CATCTGCTGAGATGGGGCCTGACCACTCCGGACAAGAAGCATCAGAAGGAGCCGCCA 

TTCCTgaaGATGGGCTACGAGCTCCATCCCGACAAGTGGACCGTGCAGCCTATCGTC 

CTCCCCGAGAAGGACAGCTGGACCGTGAACGACATCCAGAAGCTGGTGGGCAAGCTC 

AACTGGGCTAGCCAGATCTATCCCGGGATCAAGGTGCGCCAGCTCTGCAAGCTGCTG 

CGCGGCACCAAGGCCCTGACCGAGGTGATTCCCCTCACGGAGGAAGCCGAGCTCGAG 

CTGGCTGAGAACCGGGAGATCCTGAAGGAGCCCGTGCACGGCGTGTACTATGACCCC 

TCCAAGGACCTGATCGCCGAAATCCAGAAGCAGGGCCAGGGGCAGTGGACATACCAG 

ATTTACCAGGAGCCTTTCAAGAACCTCAAGACCGGCAAGTACGCCCGCATGAGGGGC 

GCCCACACCAACGATGTCAAGCAGCTGACCGAGGCCGTCCAGAAGATCACGACCGAG 

TCCATCGTGATCTGGGGGAAGACACCCAAGTTCAAGCTGCCTATCCAGAAGGAGACC 

TGGGAGACGTGGTGGACCGAATATTGGCAGGCCACCTGGATTCCCGAGTGGGAGTTC 

GTGAATACACCTCCTCTGGTGAAGCTGTGGTACCAGCTCGAGAAGGAGCCCATCGTG 

GGCGCGGAGACATTCTACGTGGACGGCGCGGCCAACCGCGAAACAAAGCTCGGGAAG 

GCCGGGTACGTCACCAACCGGGGCCGCCAGAAGGTCGTCACCCTGACCGACACCACC 

AACCAGAAGACGGAGCTGCAGGCCATCTATCTCGCTCTCCAGGACTCCGGCCTGGAG 

GTGAACATCGTGACGGACAGCCAGTACGCGCTGGGCATTATTCAGGCCCAGCCGGAC 

CAGTCCGAGAGCGAACTGGTGAACCAGATTATCGAGCAGCTGATCAAGAAAGAGAAG 

GTCTACCTCGCCTGGGTCCCGGCCCATAAGGGCATTGGCGGCAACGAGCAGGTCGAC 

AAGCTGGTGAGTGCGGGGATTAGAAAGGTGCTGATGGGTGCCCGAGCTTCGGTACTG 

TCTGGTGGAGAGCTGGACAGATGGGAGAAAATTAGGCTGCGCCCGGGAGGCAAAAAG 

AAATACAAGCTCAAGCATATCGTGTGGGCCTCGAGGGAGCTTGAACGGTTTGCCGTG 

AACCCAGGGCTGCTGGAAACATCTGAGGGATGTCGCCAGATCCTGGGGCAATTGCAG 

CCATCCCTCCAGACCGGGAGTGAAGAGCTGAGGTCCTTGTATAACACAGTGGCTACC 

CTCTACTGCGTACACCAGAGGATCGAGATTAAGGATACCAAGGAGGCCTTGGACAAA 

ATTGAGGAGGAGCAAAACAAGAGCAAGAAGAAGGCCCAGCAGGCAGCTGCTGACACT 

GGGCATAGCAACCAGGTATCACAGAACTATCCTATTGTCCAAAACATTCAGGGCCAG 

ATGGTTCATCAGGCCATCAGCCCCCGGACGCTCAATGCCTGGGTGAAGGTTGTCGAA 

GAGAAGGCCTTTTCTCCTGAGGTTATCCCCATGTTCTCCGCTTTGAGTGAGGGGGCC 

ACTCCTCAGGACCTCAATACAATGCTTAATACCGTGGGCGGCCATCAGGCCGCCATG 
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CAAATGTTGAAGGAGACTATCAACGAGGAGGCAGCCGAGTGGGACAGAGTGCATCCC 
GTCCACGCTGGCCCAATCGCGCCCGGACAGATGCGGGAGCCTCGCGGCTCTGACATT 
GCCGGCACCACCTCTACACTGCAAGAGCAAATCGGATGGATGACCAACAATCCTCCC 
ATCCCAGTTGGAGAAATCTATAAACGGTGGATCATCCTGGGCCTGAACAAGATCGTG 
5 CGCATGTACTCTCCGACATCCATCCTTGACATTAGACAGGGACCCAAAGAGCCTTTT 
AGGGATTACGTCGACCGGTTTTATAAGACCCTGCGAGCAGAGCAGGCCTCTCAGGAG 
GTCAAAAACTGGATGACGGAGACACTCCTGGTACAGAACGCTAACCCCGACTGCAAA 
ACAATCTTGAAGGCACTAGGCCCGGCTGCCACCCTGGAAGAGATGATGACCGCCTGT 
CAGGGAGTAGGCGGACCCGGACACAAAGCCAGAGTGTTGATGGTGGGTTTTCCAGTC 

10 ACACCTCAGGTACCTTTAAGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCAC 
TTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATTCACTCCCAAAGAAGACAAGAT 
ATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTCCCTGATTGGCAGAACTAC 
ACACCAGGGCCAGGGGTCAGATATCCACTGACCTTTGGATGGTGCTACAAGCTAGTA 
CCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAAGGAGAGAACACCAGCTTGTTA 

15 CACCCTGTGAGCCTGCATGGGATGGATGACCCGGAGAGAGAAGTGTTAGAGTGGAGG 
TTTGACAGCCGCCTAGCATTTCATCACGTGGCCCGAGAGCTGCATCCGGAGTACTTC 

AAGAACTGCTGA [SEQ ID NO : 79] 



20 MGPISPIETV SVKLKPGMDG PKVKQWPLTE EKIKALVEIC TEMEKEGKIS 
KIGPENPYNT PVFAI KKKDS TKWRKLVDFR ELNKRTQDFW EVQLGIPHPA 
GLKKKKSVTV LDVGDAYFSV PLDEDPRKYT AFTIPSINNE TPGIRYQYNV 
LPQGWKGSPA IFQSSMTKIL EPFRKQNPDI VIYQYMDDLY VGSDLEIGQH 
RTKIEELRQH LLRWGLTTPD KKHQKEPPFL WMGYELHPDK WTVQPIVLPE 

25 KDSWTVNDIQ KLVGKLNWAS QIYPGIKVRQ LCKLLRGTKA LTEVIPLTEE 
AELELAENRE ILKEPVHGVY YDPSKDLIAE IQKQGQGQWT YQIYQEPFKN 
LKTGKYARMR GAHTNDVKQL TEAVQKITTE SIVIWGKTPK FKLPIQKETW 
ETWWTEYWQA TWIPEWEFVN TPPLVKLWYQ LEKEPIVGAE TFYVDGAANR 
ETKLGKAGYV TNRGRQKWT LTDTTNQKTE LQAIYLALQD SGLEVNIVTD 

30 SQYAIiGIIQA QPDQSESELV NQIIEQLIKK EKVYLAWVPA HKGIGGNEQV 
DKLVSAGIRK VLMGARASVL SGGELDRWEK IRLRPGGKKK YKLKHIVWAS 
REIiERFAVNP GLLETSEGCR QILGQLQPSL QTGSEELRSL YNTVATLYCV 
HQRIEIKDTK EALDKIEEEQ NKSKKKAQQA AADTGHSNQV SQNYPIVQNI 
QGQMVHQAIS PRTLNAWVKV VEEKAFSPEV IPMFSALSEG ATPQDLNTML 

35 NTVGGHQAAM QMLKETINEE AAEWDRVHPV HAGPIAPGQM REPRGSDIAG 
TTSTLQEQIG WMTNNPPIPV GEIYKRWIIL GLNKIVRMYS PTSILDIRQG 
PKEPFRDYVD RFYKTLRAEQ ASQEVKNWMT ETLLVQNANP DCKTILKALG 
PAATLEEMMT ACQGVGGPGH KARVLMVGFP VTPQVPLRPM TYKAAVDLSH 
FLKEKGGLEG LIHSQRRQDI LDLWIYHTQG YFPDWQNYTP GPGVRYPLTF 
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GWCYKLVPVE PDRVEEAWKG ENTSLLHPVS LHGMDDPERE VLEWRFDSRL 
AFHHVARELH PEYFKNC * [SEQ ID NO : 80] 
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Figure 18 

Sequence of insert of Trng #11 

ATGGGGCCCATCAGTCCCATCGAGACCGTGCCGGTGAAGCTGAAACCCGGGATGGAC 
5 GGCCCCAAGGTCAAGCAGTGGCCACTCACCGAGGAGAAGATCAAGGCCCTGGTGGAG 
ATCTGCACCGAGATGGAGAAAGAGGGCAAGATCAGCAAGATCGGGCCGGAGAACCCA 
TACAACACCCCCGTGTTTGCCATCAAGAAGAAGGACAGCACCAAGTGGCGCAAGCTG 
GTGGATTTCCGGGAGCTGAATAAGCGGACCCAGGATTTCTGGGAGGTCCAGCTGGGC 
ATCCCCCATCCGGCCGGCCTGAAGAAGAAGAAGAGCGTGACCGTGCTGGACGTGGGC 

10 GACGCTTACTTCAGCGTCCCTCTGGACGAGGACTTTAGAAAGTACACCGCCTTTACC 
ATCCCATCTATCAACAACGAGACCCCTGGCATCAGATATCAGTACAACGTCCTCCCC 
CAGGGCTGGAAGGGCTCTCCCGCCATTTTCCAGAGCTCCATGACCAAGATCCTGGAG 
CCGTTTCGGAAGCAGAACCCCGATATCGTCATCTACCAGTACATGGACGACCTGTAC 
GTGGGCTCTGACCTGGAAATCGGGCAGCATCGCACGAAGATTGAGGAGCTGAGGCAG 

15 CATCTGCTGAGATGGGGCCTGACCACTCCGGACAAGAAGCATCAGAAGGAGCCGCCA 
TTCCTgaaGATGGGCTACGAGCTCCATCCCGACAAGTGGACCGTGCAGCCTATCGTC 
CTCCCCGAGAAGGACAGCTGGACCGTGAACGACATCCAGAAGCTGGTGGGCAAGCTC 
AACTGGGCTAGCCAGATCTATCCCGGGATCAAGGTGCGCCAGCTCTGCAAGCTGGTG 
CGCGGCACCAAGGCCCTGACCGAGGTGATTCCCCTCACGGAGGAAGCCGAGCTCGAG 

20 CTGGCTGAGAACCGGGAGATCCTGAAGGAGCCCGTGCACGGCGTGTACTATGACCCC 
TCCAAGGACCTGATCGCCGAAATCCAGAAGCAGGGCCAGGGGCAGTGGACATACCAG 
ATTTACCAGGAGCCTTTCAAGAACCTCAAGACCGGCAAGTACGCCGGCATGAGGGGC 
GCCCACACCAACGATGTCAAGCAGCTGACCGAGGCCGTCCAGAAGATCACGACCGAG 
TCCATCGTGATCTGGGGGAAGACACCCAAGTTCAAGCTGCCTATCCAGAAGGAGACC 

25 TGGGAGACGTGGTGGACCGAATATTGGCAGGCCACCTGGATTCCCGAGTGGGAGTTC 
GTGAATACACCTCCTCTGGTGAAGCTGTGGTACCAGCTCGAGAAGGAGCCCATCGTG 
GGCGCGGAGACATTCTACGTGGACGGCGCGGCCAACCGCGAAACAAAGCTCGGGAAG 
GCCGGGTACGTCACCAACCGGGGCCGCCAGAAGGTCGTCACCCTGACCGACACCACC 
AACCAGAAGACGGAGCTGCAGGCCATCTATCTCGCTCTCCAGGACTCCGGCCTGGAG 

30 GTGAACATCGTGACGGACAGCCAGTACGCGCTGGGCATTATTCAGGCCCAGCCGGAC 
CAGTCCGAGAGCGAACTGGTGAACCAGATTATCGAGCAGCTGATCAAGAAAGAGAAG 
GTCTACCTCGCCTGGGTCCCGGCCCATAAGGGCATTGGCGGCAACGAGCAGGTCGAC 
AAGCTGGTGAGTGCGGGGATTAGAAAGGTGCTGATGGTGGGTTTTCCAGTCACACCT 
CAGGTACCTTTAAGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTA 

35 AAAGAAAAGGGGGGACTGGAAGGGCTAATTCACTCCCAAAGAAGACAAGATATCCTT 
GATCTGTGGATCTACCACACACAAGGCTACTTCCCTGATTGGCAGAACTACACACCA 
GGGCCAGGGGTCAGATATCCACTGACCTTTGGATGGTGCTACAAGCTAGTACCAGTT 
GAGCCAGATAAGGTAGAAGAGGCCAATAAAGGAGAGAACACCAGCTTGTTACACCCT 
GTGAGCCTGCATGGGATGGATGACCCGGAGAGAGAAGTGTTAGAGTGGAGGTTTGAC 

40 AGCCGCCTAGCATTTCATCACGTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAAC 
TGCTGAATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGAGAGCTGGACAGATGGGAG 
AAAATTAGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCATATCGTGTGG 
GCCTCGAGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACATCTGAG 
GGATGTCGCCAGATCCTGGGGCAATTGCAGCCATCCCTCCAGACCGGGAGTGAAGAG 

45 CTGAGGTCCTTGTATAACACAGTGGCTACCCTCTACTGCGTACACCAGAGGATCGAG 
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ATTAAGGATACCAAGGAGGCCTTGGACAAAATTGAGGAGGAGCAAAACAAGAGCAAG 
AAGAAGGCCCAGCAGGCAGCTGCTGACACTGGGCATAGCAACCAGGTATCACAGAAC 
TATCCTATTGTCCAAAACATTCAGGGCCAGATGGTTCATCAGGCCATCAGCCCCCGG 
ACGCTCAATGCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATC 
5 CCCATGTTCTCCGCTTTGAGTGAGGGGGCCACTCCTCAGGACCTCAATACAATGCTT 
AATACCGTGGGCGGCCATCAGGCCGCCATGCAAATGTTGAAGGAGACTATCAACGAG 
GAGGCAGCCGAGTGGGACAGAGTGCATCCCGTCCACGCTGGCCCAATCGCGCCCGGA 
CAGATGCGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACTGCAAGAG 
CAAATCGGATGGATGACCAACAATCCTCCCATCCCAGTTGGAGAAATCTATAAACGG 

10 TGGATCATCCTGGGCCTGAACAAGATCGTGCGCATGTACTCTCCGACATCCATCCTT 
GACATTAGACAGGGACCCAAAGAGCCTTTTAGGGATTACGTCGACCGGTTTTATAAG 
ACCCTGCGAGCAGAGCAGGCCTCTCAGGAGGTCAAAAACTGGATGACGGAGACACTC 
CTGGTACAGAACGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAGGCCCGGCT 
GCCACCCTGGAAGAGATGATGACCGCCTGTCAGGGAGTAGGCGGACCCGGACACAAA 

15 GCCAGAGTGTTG [SEQ ID NO : 81] 



MGPISPIETV SVKLKPGMDG PKVKQWPLTE EKIKALVEIC TEMEKEGKIS 
KIGPENPYNT PVFAIKKKDS TKWRKLVDFR ELNKRTQDFW EVQLGIPHPA 
GLKKKKSVTV LDVGDAYFSV PLDEDFRKYT AFTIPSINNE TPGIRYQYNV 

20 LPQGWKGSPA IFQSSMTKIL EPFRKQNPDI VIYQYMDDLY VGSDLEIGQH 
RTKIEELRQH LLRWGLTTPD KKHQKEPPFL WMGYELHPDK WTVQPIVLPE 
KDSWTVNDIQ KLVGKLNWAS QIYPGIKVRQ LCKLLRGTKA LTEVIPLTEE 
AELELAENRE ILKBPVHGVY YDPSKDLIAE IQKQGQGQWT YQIYQEPFKN 
LKTGKYARMR GAHTNDVKQL TEAVQKITTE SIVIWGKTPK FKLPIQKETW 

25 ETWWTEYWQA TWIPEWEFVN TPPLVKLWYQ LEKEPIVGAE TFYVDGAANR 
ETKLGKAGYV TNRGRQKWT LTDTTNQKTE LQAIYIjALQD SGLEVNIVTD 
SQYALGIIQA QPDQSESELV NQIIEQLIKK EKVYLAWVPA HKGIGGNEQV 
DKLVSAGIRK VLMVGFPVTP QVPLRPMTYK AAVDLSHFLK EKGGLEGLIH 
SQRRQDILDL WIYHTQGYFP DWQNYTPGPG VRYPLTFGWC YKLVPVEPDK 

30 VEEANKGENT SLLHPVSLHG MDDPEREVLE WRFDSRLAFH HVARELHPEY 
FKNCMGARAS VLSGGELDRW EKIRLRPGGK KKYKLKHIVW ASRELERFAV 
NPGLLETSEG CRQILGQLQP SLQTGSEELR SLYNTVATLY cvhqrieikd 
TKEALDKIEE EQNKSKKKAQ QAAADTGHSN QVSQNYPIVQ NIQGQMVHQA 
ISPRTLNAWV KWEEKAFSP EVIPMFSALS EGATPQDLNT MLNTVGGHQA 

35 AMQMLKETIN EEAAEWDRVH PVHAGPIAPG QMREPRGSDI AGTTSTLQEQ 
IGWMTNNPPI PVGEIYKRWI ILGLNKIVRM YSPTSILDIR QGPKEPFRDY 
VDRFYKTLRA EQASQEVKNW MTETLLVQNA NPDCKTILKA LiGPAATLEEM 
MTACQGVGGP GHKARVL [SEQ ID NO : 82] 
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Figure 19 

Sequence of TgnR (Fl): 

ATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGAGAGCTGGACAGATGGGAGAAAATT 
5 AGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCATATCGTGTGGGCCTCG 
AGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACATCTGAGGGATGT 
CGCCAGATCCTGGGGCAATTGCAGCCATCCCTCCAGACCGGGAGTGAAGAGCTGAGG 
TCCTTGTATAACACAGTGGCTACCCTCTACTGCGTACACCAGAGGATCGAGATTAAG 
GATACCAAGGAGGCCTTGGACAAAATTGAGGAGGAGCAAAACAAGAGCAAGAAGAAG 

1 0 GCCCAGCAGGCAGCTGCTGACACTGGGCATAGCAACCAGGTATCACAGAACTATCCT 
ATTGTCCAAAACATTCAGGGCCAGATGGTTCATCAGGCCATCAGCCCCCGGACGCTC 
AATGCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCCATG 
TTCTCCGCTTTGAGTGAGGGGGCCACTCCTCAGGACCTCAATACAATGCTTAATACC 
GTGGGCGGCCATCAGGCCGCCATGCAAATGTTGAAGGAGACTATCAACGAGGAGGCA 

1 5 GCCGAGTGGGACAGAGTGCATCCCGTCCACGCTGGCCCAATCGCGCCCGGACAGATG 
CGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACTGCAAGAGCAAATC 
GGATGGATGACCAACAATCCTCCCATCCCAGTTGGAGAAATCTATAAACGGTGGATC 
ATCCTGGGCCTGAACAAGATCGTGCGCATGTACTCTCCGACATCCATCCTTGACATT 
AGACAGGGACCCAAAGAGCCTTTTAGGGATTACGTCGACCGGTTTTATAAGACCCTG 

20 CGAGCAGAGCAGGCCTCTCAGGAGGTCAAAAACTGGATGACGGAGACACTCCTGGTA 
CAGAACGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAGGCCCGGCTGCCACC 
CTGGAAGAGATGATGACCGCCTGTCAGGGAGTAGGCGGACCCGGACACAAAGCCAGA 
GTGTTGATGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACTTAC 
AAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTA 

25 ATTCACTCCCAAAGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGC 
TACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACC 
TTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAAT 
AAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGGATGGATGACCCG 
GAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCACGTGGCC 

30 CGAGAGCTGCATCCGGAGTACTTCAAGAACTGCATGGGCCCCATCAGTCCCATCGAG 
ACCGTGCCGGTGAAGCTGAAACCCGGGATGGACGGCCCCAAGGTCAAGCAGTGGCCA 
CTCACCGAGGAGAAGATCAAGGCCCTGGTGGAGATCTGCACCGAGATGGAGAAAGAG 
GGCAAGATCAGCAAGATCGGGCCTGAGAACCCATACAACACCCCCGTGTTTGCCATC 
AAGAAGAAGGACAGCACCAAGTGGCGCAAGCTGGTGGATTTCCGGGAGCTGAATAAG 

35 CGGACCCAGGATTTCTGGGAGGTCCAGCTGGGCATCCCCCATCCGGCCGGCCTGAAG 
AAGAAGAAGAGCGTGACCGTGCTGGACGTGGGCGACGCTTACTTCAGCGTCCCTCTG 
GACGAGGACTTTAGAAAGTACACCGCCTTTACCATCCCATCTATCAACAACGAGACC 
CCTGGCATCAGATATCAGTACAACGTCCTCCCCCAGGGCTGGAAGGGCTCTCCCGCC 
ATTTTCCAGAGCTCCATGACCAAGATCCTGGAGCCGTTTCGGAAGCAGAACCCCGAT 

40 ATCGTCATCTACCAGTACATGGACGACCTGTACGTGGGCTCTGACCTGGAAATCGGG 
CAGCATCGCACGAAGATTGAGGAGCTGAGGCAGCATCTGCTGAGATGGGGCCTGACC 
ACTCCGGACAAGAAGCATCAGAAGGAGCCGCCATTCCTgaaGATGGGCTACGAGCTC 
CATCCCGACAAGTGGACCGTGCAGCCTATCGTCCTCCCCGAGAAGGACAGCTGGACC 
GTGAACGACATCCAGAAGCTGGTGGGCAAGCTCAACTGGGCTAGCCAGATCTATCCC 

45 GGGATCAAGGTGCGCCAGCTCTGCAAGCTGCTGCGCGGCACCAAGGCCCTGACCGAG 
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GTGATTCCCCTCACGGAGGAAGCCGAGCTCGAGCTGGCTGAGAACCGGGAGATCCTG 
AAGGAGCCCGTGCACGGCGTGTACTATGACCCCTCCAAGGACCTGATCGCCGAAATC 
CAGAAGCAGGGCCAGGGGCAGTGGACATACCAGATTTACCAGGAGCCTTTCAAGAAC 
CTCAAGACCGGCAAGTACGCCCGCATGAGGGGCGCCCACACCAACGATGTCAAGCAG 
5 CTGACCGAGGCCGTCCAGAAGATCACGACCGAGTCCATCGTGATCTGGGGGAAGACA 
CCCAAGTTCAAGCTGCCTATCCAGAAGGAGACCTGGGAGACGTGGTGGACCGAATAT 
TGGCAGGCCACCTGGATTCCCGAGTGGGAGTTCGTGAATACACCTCCTCTGGTGAAG 
CTGTGGTACCAGCTCGAGAAGGAGCCCATCGTGGGCGCGGAGACATTCTACGTGGAC 
GGCGCGGCCAACCGCGAAACAAAGCTCGGGAAGGCCGGGTACGTCACCAACCGGGGC 
10 CGCCAGAAGGTCGTCACCCTGACCGACACCACCAACCAGAAGACGGAGCTGCAGGCC 
ATCTATCTCGCTCTCCAGGACTCCGGCCTGGAGGTGAACATCGTGACGGACAGCCAG 
TACGCGCTGGGCATTATTCAGGCCCAGCCGGACCAGTCCGAGAGCGAACTGGTGAAC 
CAGATTATCGAGCAGCTGATCAAGAAAGAGAAGGTCTACCTCGCCTGGGTCCCGGCC 
CATAAGGGCATTGGCGGCAACGAGCAGGTCGACAAGCTGGTGAGTGCGGGGATTAGA 

15 AAGGTGCTGTAA [SEQ ID NO: 83] 

MGARASVLSG GELDRWEKIR LRPGGKKKYK LKHIVWASRE LERFAVNPGL 
LETSEGCRQI LGQLQPSLQT GSEELRSLYN TVATLYCVHQ RIEIKDTKEA 
LDKIEEEQNK SKKKAQQAAA DTGHSNQVSQ NYPIVQNIQG QMVHQAISPR 

20 TLNAWVKWE EKAFSPEVIP MFSALSEGAT PQDLNTMLNT VGGHQAAMQM 
L.KETINEEAA EWDRVHPVHA GPIAPGQMRE PRGSDIAGTT STLQEQIGWM 
TNNPPIPVGE lYKRWIILGL NKIVRMYSPT SILDIRQGPK EPFRDYVDRF 
YKTLRAEQAS QEVKNWMTET LLVQNANPDC KTILKALGPA ATLEEMMTAC 
QGVGGPGHKA RVLMVGFPVT PQVPLRPMTY KAAVDLSHFL KEKGGLEGLI 

25 HSQRRQDILD LWIYHTQGYF PDWQNYTPGP GVRYPLTFGW CYKLVPVEPD 
KVEEANKGEN TSLLHPVSLH GMDDPEREVL EWRFDSRLAF HHVARELHPE 
YFKNCMGPIS PIETVSVKLK PGMDGPKVKQ WPLTEEKIKA LVEICTEMEK 
EGKISKIGPE NPYNTPVFAI KKKDSTKWRK LVDFRELNKR TQDFWEVQLG 
IPHPAGLKKK KSVTVLDVGD AYFSVPLDED FRKYTAFTIP SINNETPGIR 

30 YQYNVLPQGW KGSPAIFQSS MTKILEPFRK QNPDIVIYQY MDDLYVGSDL 
EIGQHRTKIE ELRQHLLRWG LTTPDKKHQK EPPFLWMGYE LHPDKWTVQP 
IVLPEKDSWT VNDIQKLVGK LNWASQIYPG IKVRQIjCKLL RGTKALTEVI 
PLTEEAELEL AENREILKEP VHGVYYDPSK DLIAEIQKQG QGQWTYQIYQ 
EPFKNLKTGK YARMRGAHTN DVKQLTEAVQ KITTESIVIW GKTPKFKLPI 

35 QKETWETWWT EYWQATWIPE WEFVNTPPLV KLWYQLEKEP IVGAETFYVD 

GAANRETKLG KAGYVTNRGR qkwtltdtt nqktelqaiy lalqdsglev 

NIVTDSQYAL GIIQAQPDQS ESELVNQIIE QLIKKEKVYL AWVPAHKGIG 
GNEQVDKLVS AGIRKVL* 
[SEQ ID NO: 84] 
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Figure 23 

Sample of the CDS immune response that is detected in the mouse model following immunisation with 
the reformatted constructs. The constructs are comprised Gag, RT and Nef as a triple fusion. G is 
Gag, RisRT and N is Nef 

Responses to Gag peptide by IFN-gamma ELIspot on Day 35 (7 days post-boost) 
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Responses to Nef peptide by IFN-gamma ELIspot at Day 35 (7 days post-boost) 
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Figure 24 




